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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



CGAGGCACCG 
ACCATGAAGA 
GCTGCGGCCG 
GTCGTCTTCG 
GCCCAGTTGA 
GGCAGTCTGG 
AAGAAGGCCG 
GCGGCCGCCG 
GTCAC GCAG A 
ATGGAGTTGC 
GCTACGCCGC 
GTNTGCGCAG 
GNCAC CAGNG 



GTAGTTTGAA 
TGGTGAAATC 
GTGTGACTTC 
GCGCGCCACT 
CCAGCCTGCT 
TCGAGGGCGG 
CCGAGCACGG 
GTTCGGCCAC 
ACGTCACGTT 
TGCAGGCCGC 
CCGCCTGGTG 
GGNCGCACGC 
ANCACCCCCN 



CCAAACGCAC 
GATCGCCGCA 
GATCATGGCT 
GCCGTTGGAC 
CAACAGCCTC 
CATCGGGGGC 
GGATCTGCCG 
CGCCGACGTT 
CGTGAATCAA 
AGGGNAACTG 
ACGCGTCCAT 
ACCGCCCGGT 
NNTCGNCNNT 



AATCGACGGG CAAACGAACG GAAGAACACA 
GGTCTGACCG CCGCGGCTGC AATCGGCGCC 
GGCGGCCCGG TCGTATACCA GATGCAGCCG 
CCGGCATCCG CCCCTGACGT CCCGACCGCC 
GCCGATCCCA ACGTGTCGTT TGCGAACAAG 
ACCGAGGCGC GCATCGCCGA CCACAAGCTG 
CTGTCGTTCA GCGTGACGAA CATCCAGCCG 
TCCGTCTCGG GTCCGAAGCT CTCGTCGCCG 
GGCGGCTGGA TGCTGTCACG CGCATCGGCG 
ATTGGCGGGC CGGNTTCAGC CCGCTGTTCA 
GTCGAACACT CGCGCGTGTA GCACGGTGCG 
GCAAGCCGTC CTCGAGATAG GTGGTGNCTC 
TCTCGNTGNT GNATGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
766 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATGCATCACC ATCACCATCA CGATGAAGTC ACGGTAGAGA CGACCTCCGT CTTCCGCGCA 60 

GACTTCCTCA GCGAGCTGGA CGCTCCTGCG CAAGCGGGTA CGGAGAGCGC GGTCTCCGGG 120 

GTGGAAGGGC TCCCGCCGGG CTCGGCGTTG CTGGTAGTCA AACGAGGCCC CAACGCCGGG 180 

TCCCGGTTCC TACTCGACCA AGCCATCACG TCGGCTGGTC GGCATCCCGA CAGCGACATA 240 

TTTCTCGACG ACGTGACCGT GAGCCGTCGC CATGCTGAAT TCCGGTTGGA AAACAACGAA 300 

TTCAATGTCG TCGATGTCGG GAGTCTCAAC GGCAC CTACG TCAACCGCGA GCCCGTGGAT 360 

TCGGCGGTGC TGGCGAACGG CGACGAGGTC CAGATCGGCA AGCTCCGGTT 'GGTGTTCTTG 420 

ACCGGACCCA AGCAAGGCGA GGATGACGGG AGTACCGGGG GCCCGTGAGC GCACCCGATA 480 

GCCCCGCGCT GGCCGGGATG TCGATCGGGG CGGTCCTCCG ACCTGCTACG ACCGGATTTT 540 

CCCTGATGTC CACCATCTCC AAGATTCGAT TCTTGGGAGG CTTGAGGGTC NGGGTGACCC 600 

CCCCGCGGGC CTCATTCNGG GGTNTCGGCN GGTTTCACCC CNTACCNACT GCCNCCCGGN 660 

TTGCNAATTC NTTCTTCNCT GCCCNNAAAG GGACCNTTAN CTTGCCGCTN GAAANGGTNA 720 

TCCNGGGCCC NTCCTNGAAN CCCCNTCCCC CT 752 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CATATGCATC ACCATCACCA TCACACTTCT AACCGCCCAG CGCGTCGGGG GCGTCGAGCA 60 
CCACGCGACA CCGGGCCCGA TCGATCTGCT AGCTTGAGTC TGGTCAGGCA TCGTCGTCAG 120 
CAGCGCGATG CCCTATGTTT GTCGTCGACT CAGATATCGC GGCAATCCAA TCTCCCGCCT 18 0 



73 



GCGGC CGGCG GTGCTGCAAA CTACTCCCGG 
ATGCTGGTCA CGGCTGTCGT TTTGCTCTGT 
ACCTACTGCG AGGAGTTGAA AGGCACCGAT 
GACCCGGCCT ACAACATCAA CATCAGCCTG 
GAAAATTACA TCGCCCAGAC GCGCGACAAG 
CGCGAAGCCC CCTACGAATT GAATATCACC 
CGTGGTACGC AGGCCGTGGT GCTCAMGGTC 
ACCACGTACA AGGCCTTCGA TTGGGACCAG 
CTGTGGCAGG CTGACACCGA TCCGCTGCCA 
GAGCAACGCA GACCGGGACA ACWGGTATCG 
TGAAATTATC ACAACTTCGC AGTCACNAAA 



AGGAATTTCG ACGTGCGCAT CAAGATCTTC 
TGTTCGGGTG TGGCCACGGC CGCGCCCAAG 
ACCGGCCAGG CGTGCCAGAT TCAAATGTCC 
CCCAGTTACT ACCCCGACCA GAAGTCGCTG 
TTCCTCAGCG CGGCCACATC GTCCACTCCA 
TCGGCCACAT ACCAGTCCGC GATACCGCCG 
TACCACAACG CCGGCGGCAC GCACCCAACG 
GCCTATCGCA AGCCAATCAC CTATGACACG 
GTCGTCTTCC CCATTGTTGC AAGGTGAACT 
ATAGCCGCCN AATGCCGGCT TGGAACCCNG 
NAA 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
813 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CGGTATGAAC ACGGCCGCGT CCGATAACTT CCAGCTGTCC CAGGGTGGGC AGGGATTCGC 60 

CATTCCGATC GGGCAGGCGA TGGCGATCGC GGGCCAGATC CGATCGGGTG GGGGGTCACC 120 

CACCGTTCAT ATCGGGCCTA CCGCCTTCCT CGGCTTGGGT GTTGTCGACA ACAACGGCAA 18 0 

CGGCGCACGA GTCCAACGCG TGGTCGGGAG CGCTCCGGCG GCAAGTCTCG GCATCTCCAC 240 

CGGCGACGTG ATCACCGCGG TCGACGGCGC TCCGATCAAC TCGGCCACCG CGATGGCGGA 300 

CGCGCTTAAC GGGCAT CATC CCGGTGACGT CATCTCGGTG AACTGGCAAA CCAAGTCGGG 360 

CGGCACGCGT ACAGGGAACG TGACATTGGC CGAGGGACCC CCGGCCTGAT TTCGTCGYGG 420 

ATACCACCCG CCGGCCGGCC AATTGGA 447 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GTCCCACTGC GGTCGCCGAG TATGTCGCCC AGCAAATGTC TGGCAGCCGC CCAACGGAAT 60 

CCGGTGATCC GACGT CGCAG GTTGTCGAAC CCGCCGCCGC GGAAGTATCG GTCCATGCCT 12 0 

AGCCCGGCGA CGGCGAGCGC CGGAATGGCG CGAGTGAGGA GGCGGGCAAT TTGGCGGGGC 18 0 

CCGG CGACGG NGAGCGCCGG AATGGCGCGA GTGAGGAGGT GGNCAGTCAT GCCCAGNGTG 240 

ATCCAATCAA CCTGNATTCG GNCTGNGGGN CCATTTGACA ATCGAGGTAG TGAGCGCAAA 3 00 

TGAATGATGG AAAACGGGNG GNGACGTCCG NTGTTCTGGT GGTGNTAGGT GNCTGNCTGG 360 

NGTNGNGGNT ATCAGGATGT TCTTCGNCGA AANCTGATGN CGAGGAACAG GGTGTNCCCG 420 

NNANNCCNAN GGNGTCCNAN CCCNNNNTCC TCGNCGANAT CANANAGNCG NTTGATGNGA 48 0 

NAAAAGGGTG GANCAGNNNN AANTNGNGGN CCNAANAANC NNNANNGNNG NNAGNTNGNT 540 

NNNTNTTNNC ANNNNNNNTG NNGNNGNNCN NNNCAANCNN NTNNNNGNAA NNGGNTTNTT 600 

NAAT 604 

(2) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



TTGCANGTCG AACCACCTCA CTAAAGGGAA CAAAAGCTNG AGCTCCACCG CGGTGGCGGC 60 

CGCTCTAGAA CTAGTGKATM YYYCKGGCTG CAGSAATYCG GYACGAGCAT TAGGACAGTC 120 

TAACGGTCCT GTTACGGTGA TCGAATGACC GACGACATCC TGCTGATCGA CACCGACGAA 180 

CGGGTGCGAA CCCTCACCCT CAACCGGCCG CAGTCCCGYA ACGCGCTCTC GGCGGCGCTA 240 

CGGGATCGGT TTTTCGCGGY GTTGGYCGAC GCCGAGGYCG ACGACGACAT CGACGTCGTC 300 

ATCCTCACCG GYGCCGATCC GGTGTTCTGC GCCGGACTGG ACCTCAAGGT AGCTGGCCGG 360 

GCAGACCGCG CTGCCGGACA TCTCACCGCG GTGGGCGGCC ATGACCAAGC CGGTGATCGG 420 

CGCGATCAAC GGCGC CGCGG TCACCGGCGG GCTCGAACTG GCGCTGTACT GCGACATCCT 480 

GATCGCCTCC GAGCACGCCC GCTTCGNCGA CACCCACGCC CGGGTGGGGC TGCTGCCCAC 540 

CTGGGGACTC AGTGTGTGCT TGCCGCAAAA GGTCGGCATC GGNCTGGGCC GGTGGATGAG 600 

CCTGACCGGC GACTACCTGT CCGTGACCGA CGC 633 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1362 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGACGACGAC GGCGCCGGAG AGCGGGCGCG AACGGCGATC GACGCGGCCC TGGCCAGAGT 60 

CGGCACCACC CAGGAGGGAG TCGAATCATG AAATTTGTCA ACCATATTGA GCCCGTCGCG 120 

CCCCGCCGAG CCGGCGGCGC GGTCGCCGAG GTCTATGCCG AGGCCCGCCG CGAGTTCGGC 180 

CGGCTGCCCG AGCCGCTCGC CATGCTGTCC CCGGACGAGG GACTGCTCAC CGCCGGCTGG 240 

GCGACGTTGC GCGAGACACT GCTGGTGGGC CAGGTGCCGC GTGGCCGCAA GGAAGCCGTC 300 

GCCGCCGCCG TCGCGGCCAG CCTGCGCTGC CCCTGGTGCG TCGACGCACA CACCACCATG 360 

CTGTACGCGG CAGGCCAAAC CGACACCGCC GCGGCGATCT TGGCCGGCAC AGCACCTGCC 420 

GCCGGTGACC CGAACGCGCC GTATGTGGCG TGGGCGGCAG GAACCGGGAC ACCGGCGGGA 480 

CCGCCGGCAC CGTTCGGCCC GGATGTCGCC GCCGAATACC TGGGCACCGC GGTGCAATTC 540 

CACTTCATCG CACGCCTGGT CCTGGTGCTG CTGGACGAAA CCTTCCTGCC GGGGGGCCCG 600 

CGCGCCCAAC AGCTCATGCG CCGCGCCGGT GGACTGGTGT TCGCCCGCAA GGTGCGCGCG 660 

GAGCATCGGC CGGGCCGCTC CACCCGCCGG CTCGAGCCGC GAACGCTGCC CGACGATCTG 720 

GCATGGGCAA CACCGTCCGA GCCCATAGCA ACCGCGTTCG CCGCGCTCAG CCACCACCTG 780 - 

GACACCGCGC CGCACCTGCC GCCACCGACT CGTCAGGTGG TCAGGCGGGT CGTGGGGTCG 840 

TGGCACGGCG AGCCAATGCC GATGAGCAGT CGCTGGACGA ACGAGCACAC CGCCGAGCTG 900 

CCCGCCGACC TGCACGCGCC CACCCGTCTT GCCCTGCTGA CCGGCCTGGC CCCGCATCAG 960 

GTGACCGACG ACGACGTCGC CGCGGCCCGA TCCCTGCTCG ACACCGATGC GGCGCTGGTT 1020 

GGCGCCCTGG CCTGGGCCGC CTTCACCGCC GCGCGGCGCA TCGGCACCTG GATCGGCGCC 1080 

GCCGCCGAGG GCCAGGTGTC GCGGCAAAAC CCGACTGGGT GAGTGTGCGC GCCCTGTCGG 1140 

TAGGGTGTCA TCGCTGGCCC GAGGGATCTC GCGGCGGCGA ACGGAGGTGG CGACACAGGT 1200 

GGAAGCTGCG CCCACTGGCT TGCGCCCCAA CGCCGTCGTG GGCGTTCGGT TGGCCGCACT 1260 

GGCCGATCAG GTCGGCGCCG GCCCTTGGCC GAAGGTCCAG CTCAACGTGC CGTCACCGAA 1320 

GGACCGGACG GTCACCGGGG GTCACCCTGC GCGCCCAAGG AA 1362 



(2) INFORMATION FOR SEQ ID NO: 8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 



GCGACGACCC CGATATGCCG GGCACCGTAG CGAAAGCCGT CGCCGACGCA CTCGGGCGCG 60 

GTATCGCTCC CGTTGAGGAC ATTCAGGACT GCGTGGAGGC CCGGCTGGGG GAAGCCGGTC 120 

TGGATGACGT GGCC CGTGTT TACATCATCT ACCGGCAGCG GCGCGCCGAG CTGCGGACGG 180 

CTAAGGCCTT GCTCGGCGTG CGGGACGAGT TAAAGCTGAG CTTGGCGGCC GTGACGGTAC 240 

TGCGCGAGCG CTATCTGCTG CACGACGAGC AGGGCCGGCC GGCCGAGTCG ACCGGCGAGC 300 

TGATGGACCG ATCGGCGCGC TGTGTCGCGG CGGCCGAGGA CCAGTATGAG CCGGGCTCGT 360 

CGAGGCGGTG GGCCGAGCGG TTCGCCACGC TATTACGCAA CCTGGAATTC CTGCCGAATT 420 

CGCCCACGTT GATGAACTCT GGCACCGACC TGGGACTGCT CGCCGGCTGT TTTGTTCTGC 480 

CGATTGAGGA TTCG CTGCAA TCGATCTTTG CGACGCTGGG ACAGGCCGCC GAGCTGCAGC 540 

GGGCTGGAGG CGGCACCGGA TATGCGTTCA GCCACCTGCG ACCCGCCGGG GATCGGGTGG 600 

CCTCCACGGG CGGCACGGCC AGCGGACCGG TGTCGTTTCT ACGGCTGTAT GACAGTGCCG 660 

CGGGTGTGGT CTCCATGGGC GGTCGCCGGC GTGGCGCCTG TATGGCTGTG CTTGATGTGT 720 

CGCACCCGGA TATC TGTGAT TTCGTCACCG CCAAGGCCGA ATCCCCCAGC GAGCTCCCGC 780 

ATTTCAACCT ATCGGTTGGT GTGACCGACG CGTTCCTGCG GGCCGTCGAA CGCAACGGCC 840 

TACACCGGCT GGTCAATCCG CGAACCGGCA AGATCGTCGC GCGGATGCCC GCCGCCGAGC 900 

TGTTCGACGC CATCTGCAAA GCCGCGCACG CCGGTGGCGA TCCCGGGCTG GTGTTTCTCG 960 

ACACGATCAA TAGGGCAAAC CCGGTGCCGG GGAGAGGCCG CATCGAGGCG ACCAACCCGT 1020 

GCGGGGAGGT CCCACTGCTG CCTTACGAGT CATGTAATCT CGGCTCGATC AACCTCGCCC 1080 

GGATGCTCGC CGAC GGTCGC GTCGACTGGG ACCGGCTCGA GGAGGTCGCC GGTGTGGCGG 1140 

TGCGGTTCCT TGATGACGTC ATCGATGTCA GCCGCTACCC CTTCCCCGAA CTGGGTGAGG 1200 

CGGCCCGCGC CACCCGCAAG ATCGGGCTGG GAGTCATGGG TTTGGCGGAA CTGCTTGCCG 1260 

CACTGGGTAT TCCGTACGAC AGTGAAGAAG CCGTGCGGTT AGCCACCCGG CTCATGCGTC 1320 

GCATACAGCA GGCGGCGCAC ACGGCATCGC GGAGGCTGGC CGAAGAGCGG GGCGCATTCC 1380 

CGGCGTTCAC CGATAGCCGG TTCGCGCGGT CGGGCCCGAG GCGCAACGCA CAGGTCACCT 1440 

CCGTCGCTCC GACGGGCA 1458 



(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



ACGGTGTAAT CGTGCTGGAT CTGGAACCGC GTGGCCCGCT ACCTACCGAG ATCTACTGGC 60 

GGCGCAGGGG GCTGGCCCTG GGCATCGCGG TCGTCGTAGT CGGGATCGCG GTGGCCATCG 120 

TCATCGCCTT CGTCGACAGC AGCGCCGGTG CCAAACCGGT CAGCGCCGAC AAGCCGGCCT 180 

CCGCCCAGAG CCATCCGGGC TCGCCGGCAC CCCAAGCACC CCAGCCGGCC GGGCAAACCG 24 0 

AAGGTAACGC CGCCGCGGCC CCGCCGCAGG GCCAAAACCC CGAGACACCC ACGCCCACCG 30 0 

CCGCGGTGCA GCCGCCGCCG GTGCTCAAGG AAGGGGACGA TTGCCCCGAT TCGACGCTGG 360 

CCGTCAAAGG TTTGACCAAC GCGCCGCAGT ACTACGTCGG CGACCAGCCG AAGTTCACCA 42 0 

TGGTGGTCAC CAACATCGGC CTGGTGTCCT GTAAACGCGA CGTTGGGGCC GCGGTGTTGG 480 

CCGCCTACGT TTACTCGCTG GACAACAAGC GGTTGTGGTC CAACCTGGAC TGCGCGCCCT 540 

CGAATGAGAC GCTGGTCAAG ACGTTTTCCC CCGGTGAGCA GGTAACGACC GCGGTGACCT 600 



76 



GGACCGGGAT GGGATCGGCG CCGCGCTGCC CATTGCCGCG GCCGGCGATC GGGCCGGGCA 

CCTACAATCT CGTGGTACAA CTGGGCAATC TGCGCTCGCT GCCGGTTCCG TTCATCCTGA 

ATCAGCCGCC GCCGCCGCCC GGGCCGGTAC CCGCTCCGGG TCCAGCGCAG GCGCCTCCGC 

CGGAGTCTCC CGCGCAAGGC GGATAATTAT TGATCGCTGA TGGTCGATTC CGCCAGCTGT 
GACAACCCCT CGCCTCGTGC CG 



660 



720 
780 
840 
862 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TTGATCAGCA CCGGCAAGGC GTCACATGCC TCCCTGGGTG TGCAGGTGAC CAATGACAAA 60 

GACACCCCGG GCGCCAAGAT CGTCGAAGTA GTGGCCGGTG GTGCTGCCGC GAACGCTGGA 120 

GTGCCGAAGG GCGTCGTTGT CACCAAGGTC GACGACCGCC CGATCAACAG CGCGGACGCG 180 

TTGGTTGCCG CCGTGCGGTC CAAAGCGCCG GGCGCCACGG TGGCGCTAAC CTTTCAGGAT 240 

CCCTCGGGCG GTAGCCGCAC AGTGCAAGTC ACCCTCGGCA AGGCGGAGCA GTGATGAAGG 300 

TCGCCGCGCA GTGTT CAAAG CTCGGATATA CGGTGGCACC CATGGAACAG CGTGCGGAGT 360 

TGGTGGTTGG CCGGGCACTT GTCGTCGTCG TTGACGATCG CACGGCGCAC GGCGATGAAG 420 

ACCACAGCGG GCCGCTTGTC ACCGAGCTGC TCACCGAGGC CGGGTTTGTT GTCGACGGCG 480 

TGGTGGCGGT GTCGGCCGAC GAGGTCGAGA TCCGAAATGC GCTGAACACA GCGGTGATCG 540 

GCGGGGTGGA CCTGGTGGTG TCGGTCGGCG GGACCGGNGT GACGNCTCGC GATGTCACCC 600 

CGGAAGCCAC CCGNGACATT CT 622 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGCGCAGCGG TAAGCCTGTT GGCCGCCGGC ACACTGGTGT TGACAGCATG CGGCGGTGGC 60 

ACCAACAGCT CGTCGTCAGG CGCAGGCGGA ACGTCTGGGT CGGTGCACTG CGGCGGCAAG 120 

AAGGAGCTCC ACTCCAGCGG CTCGACCGCA CAAGAAAATG CCATGGAGCA GTTCGTCTAT 180 

GCCTACGTGC GATCGTGCCC GGGCTACACG TTGGACTACA ACGCCAACGG GTCCGGTGCC 240 

GGGGTGACCC AGTTTCTCAA CAACGAAACC GATTTCGCCG GCTCGGATGT CCCGTTGAAT 300 

CCGTCGACCG GTCAACCTGA CCGGTCGGCG GAGCGGTGCG GTTCCCCGGC ATGGGAC CTG 360 

CCGACGGTGT TCGGCCCGAT CGCGATCACC TACAATATCA AGGGCGTGAG CACGCTGAAT 42 0 

CTTGACGGAC CCACTACCGC CAAGATTTTC AACGGCACCA TCACCGTGTG GAATGATCCA 480 

CAGATCCAAG CCCTCAACTC CGGCACCGAC CTGCCGCCAA CACCGATTAG CGTTATCTTC 540 

CGCAGCGACA AGTCCGGTAC GTCGGACAAC TTCCAGAAAT ACCTCGACGG TGTATCCAAC 600 

GGGGCGTGGG GCAAAGGCGC CAGCGAAACG TTCAGCGGGG GCGTCGGCGT CGGCGCCAGC 660 

GGGAACAACG GAACGTCGGC C CTACTGCAG ACGACCGACG GGTCGATCAC CTACAACGAG 720 

TGGTCGTTTG CGGTGGGTAA GCAGTTGAAC ATGGCCCAGA TCATCACGTC GGCGGGTCCG 780 

GATCCAGTGG CGATCACCAC CGAGTCGGTC GGTAAGACAA TCGCCGGGGC CAAGATCATG 840 

GGACAAGGCA ACGACCTGGT ATTGGACACG TCGT CGTTCT ACAGACCCAC CCAGCCTGGC 900 

TCTTACCCGA TCGTGCTGGC GACCTATGAG ATCGTCTGCT CGAAATACCC GGATGCGACG 960 

ACCGGTACTG CGGTAAGGGC GTTTATGCAA GCCGCGATTG GTCCAGGCCA AGAAGGCCTG 1020 
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GACCAATACG GCTCCATTCC GTTGCCCAAA TCGTTCCAAG CAAAATTGGC GGCCGCGGTG 1080 
AATGCTATTT CTTGACCTAG TGAAGGGAAT TCGACGGTGA GCGATGCCGT TCCGCAGGTA 1140 
GGGTCGCAAT TTGGGCCGTA TCAGCTATTG CGGCTGCTGG GCCGAGGCGG GATGGGCGAG 1200 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



GCAAGCAGCT 
AGACCGGCTA 
CGTTTCTGCA 
GGTTGCTCCA 
CCGCGACCGG 
ACGCGGGCCG 
CGGCCGACTA 
TCATCGAGGC 
GTGTGCCCAT 
GGCTGGTTTA 
AAGC CAACGA 
GCGCCGTAGT 
GCTACCTGCC 
CGCTGAACGC 
TCGACAAGGC 
AGCTGCTGTT 
AGATGGGCGG 
GCTGGGACGC 
TTTCCGCCCT 
CGAGTAGCCT 



GCAGGTCGTG 
CACCACGGAT 
ACATCTGCTC 
AGCGGTGGCC 

GCGGATCCGG 
CAGCCAGATC 
GTTCAACACC 
CGACGAGGTC 
CGGGTTGAGC 
GCAGATGGAC 
CGAGCGGGCC 
CGAGCTGGAC 
GCCGATCCAG 
GCTCAACGAG 
CGAAATCGCC 
CGCTTACCCG 
GGCGGCGCAC 
GAGTTCACGC 
CGTCA 



CTGTTCGACG 
GCCGACGCGC 
GCCCACCGCG 
GCCGACGGCC 
TCGACCGAAC 
GACGCGTTCG 
GAGATGCGGA 
GGGGAGGACC 
AC CGGCGAGT 
GCCTACGGCC 
GCGTATTTCG 
CGCAAGGACG 
AGCAGCAACC 
GGCAGCGCGG 
GCACAGCTGG 
CCCGGTGAAC 
CTCGACGTCC 
TGAGTGCCGA 
TCGGCGCAAT 



AACTGGGCAT 
TGCAGTCGTT 
ACGTCACCCG 
GCATCCACAC 
CCAACCTGCA 
TGGTCGGGGA 
TCATGGGGCA 
TGTATTCGTT 
TGCGGCGCCG 
TGTCGCAGCA 
CCCGATTCGG 
GCTACACCTC 
GTCAAGTGCG 
CCGACATCAT 
CGTCGCGCAT 
GCGAGCGGGT 
CGCTGGAGGT 
GCGTGCATCT 
CGGGACCGAG 



GCCGAAGACC 
GTTCGACAAG 
GCTCAAGGTC 
CACGTTCAAC 
GAACATCCCG 
CGGTTACGCC 
CCTGTCCGGG 
CGTCGCGTCC 
GGTCAAGGCG 
GTTGAAAATC 
CGGGGTGCGC 
GACGGTGCTG 
GGAGGC CGCC 
CAAGGTGGCC 
GCTGCTGCAG 
CGAGGCCCTG 
GTCGGTGGGC 
GGGGCGGGAA 
TTTGTCCAGC 



AAACGCACCA 

ACCGGGCATC 

ACCGTCGACG 

CAGACGATCG 

ATCCGCACCG 

GAGTTGATGA ' 

GACGAGGGCC 

CGGGTGTTCG 

ATGTCCTACG 

TCCACCGAGG 

GACTACCTGC 

GGCCGTCGCC 

GAGCGGGCGG 

ATGATC CAGG 

GTCCACGACG 

GTGCGCGACA 

TACGGCCGCA 

TTCGGCGATT 

GTGTACCCGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
96 0 
1020 
1080 
1140 
1155 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1771 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



GAGCGCCGTC 
TCGGGCCTCG 
ATCGAAGACA 
GGCCGTCGGA 
ATCGAGAACT 
ACCTAGTTGT 
GGCCCGAGTA 
GTATTCGCCA 
GTACAGCCAG 
CCGTCAACCC 



TGGTGTTTGA 
GGTTGGCGAT 
CCGACCCAGG 
TGCCGATTCC 
CTCGGGGTTC 
GCAGTTACTG 
GTGGGCCTAG 
CCGCCGCAGC 
CAGTTCGACT 
TACGAGGCGT 



ACGGTTTTAC 
CGTCAAACAG 
CGGCCAGCCC 
GCAGCTTCCC 
GGCGAACGTT 
TTGAAAGCCA 
TACAGGAAGA 
AGCCGGGAAC 
GGCGTTACCC 
TGGGTGGTAC 



CGGTCGGCAT 
GTGGTGCTCA 
CCTGGAACGT 
GGTGCGACGG 
ATCTCAGTGG 
CACCCATGCC 
GCAACCTAGC 
CCCAGGTTAT 
ACCGTCCCCG 
CCGGCCGGGT 



CGGCACGGGC 
ACCACGGCGG 
CGATTTACGT 
CTGGCG CTCG 
AATCTCAGTC 
AGTCCACGCA 
GACATGACGA 
GCTCAGGGGC 
CCCCCGCAGC 
CTGATAC CTG 



GTTGCCGGGT 
ATTGCTGCGC 
GCTGCTCCCC 
GAGCACGGAC 
CACGCGCGCA 
TGGCCAAGTT 
ATCACCCACG 
AGCAGCAAAC 
CAACCCAGTA 
GCGTGATTCC 



120 
180 
240 
300 
360 
420 
480 
540 
600 
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GACCATGACG 
CGGCGCGGTG 
CGGGTTCAAC 
AAGCATCCCC 
GCCCAGTGTC 
CATTCTGTCT 



CCCCCTCCTG 
ACGATAGCGG 
CGGGCACCCG 
GCAGCAAACA 
GTCATGTTGG 
GCCGAGGGGC 



GGATGGTTCG CCAACGCCCT CGTGCAGGCA TGTTGGCCAT 
TGGTGTCCGC CGGCATCGGC GGCGCGGCCG CATCCCTGGT 
CCGGCCCCAG CGGCGGCCCA GTGGCTGCCA GCGCGGCGCC 
TGCCGCCGGG GTCGGTCGAA CAGGTGGCGG CCAAGGTGGT 
AAACCGATCT GGGCCGCCAG TCGGAGGAGG GCTCCGGCAT 
TGATCTTGAC CAACAACCAC GTGATCGCGG CGGCCGCCAA 



660 
720 
780 
840 
900 
960 



GCCTCCCCTG GGCAGTC CGC CGCCGAAAAC GACGGTAACC TTCTCTGACG GGCGGACCGC 102 0 

ACCCTTCACG GTGGTGGGGG CTGACCCCAC CAGTGATATC GCCGTCGTCC GTGTTCAGGG 108 0 

CGTCTCCGGG CTCACCCCGA TCTCCCTGGG TTCCTCCTCG GACCTGAGGG TCGGTCAGCC 1140 

GGTGCTGGCG ATCGGGTCGC CGCTCGGTTT GGAGGGCACC GTGACCACGG GGATCGTCAG 1200 

CGCTCTCAAC CGTCCAGTGT CGACGACCGG CGAGGCCGGC AACCAGAACA CCGTGCTGGA 1260 

CGCCATTCAG ACCGACGCCG CGATCAACCC CGGTAACTCC GGGGGCGCGC TGGTGAACAT 1320 

GAACGCTCAA CTCGTCGGAG TCAACTCGGC CATTGCCACG CTGGGCGCGG ACTCAGCCGA 1380 

TGCGCAGAGC GGCTCGATCG GTCTCGGTTT TGCGATTCCA GTCGACCAGG CCAAGCGCAT 1440 

CGCCGACGAG TTGATCAGCA CCGGCAAGGC GTCACATGCC TCCCTGGGTG TGCAGGTGAC 1500 

CAATGACAAA GACACCCCGG GCGCCAAGAT CGTCGAAGTA GTGGCCGGTG GTGCTGCCGC 1560 

GAACGCTGGA GTGCCGAAGG GCGTCGTTGT CACCAAGGTC GACGACCGCC CGATCAACAG 1620 

CGCGGACGCG TTGGTTGCCG CCGTGCGGTC CAAAGCGCCG GGCGCCACGG TGGCGCTAAC 168 0 

CTTTCAGGAT CCCTCGGGCG GTAGCCGCAC AGTGCAAGTC ACCCTCGGCA AGGCGGAGCA 1740 

GTGATGAAGG TCGCCGCGCA GTGTTCAAAG C 1771 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1058 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

CTCCACCGCG GTGGCGGCCG CTCTAGAACT AGTGGATCCC CCGGGCTGCA GGAATTCGGC 60 

ACGAGGATCC GACGTCGCAG GTTGTCGAAC CCGCCGCCGC GGAAGTATCG GTCCATGCCT 120 

AGCCCGGCGA CGGCGAGCGC CGGAATGGCG CGAGTGAGGA GGCGGGCAAT TTGGCGGGGC 180 

CCGGCGACGG CGAGCGCCGG AATGGCGCGA GTGAGGAGGC GGGCAGTCAT GCCCAGCGTG 240 

ATCCAATCAA CCTGCATTCG GCCTGCGGGC CCATTTGACA ATCGAGGTAG TGAGCGCAAA 3 00 

TGAATGATGG AAAACGGGCG GTGACGTCCG CTGTTCTGGT GGTGCTAGGT GCCTGCCTGG 360 

CGTTGTGGCT ATCAGGATGT TCTTCGCCGA AACCTGATGC CGAGGAACAG GGTGTTCCCG 420 

TGAGCCCGAC GGCGTCCGAC CCCGCGCTCC TCGCCGAGAT CAGGCAGTCG CTTGATGCGA 480 

CAAAAGGGTT GACCAGCGTG CACGTAGCGG TCCGAACAAC CGGGAAAGTC GACAGCTTGC 540 

TGGGTATTAC CAGTG CCGAT GTCGACGTCC GGGCCAATCC GCTCGCGGCA AAGGGCGTAT 600 

GCACCTACAA CGACGAGCAG GGTGTCCCGT TTCGGGTACA AGGC GACAAC ATCTCGGTGA 660 

AACTGTTCGA CGACTGGAGC AATCTCGGCT CGATTTCTGA ACTGTCAACT TCACGCGTGC 720 

TCGATCCTGC CGCTGGGGTG ACGCAGCTGC TGTCCGGTGT CACGAACCTC CAAGCGCAAG 780 

GTACCGAAGT GATAGACGGA ATTTCGACCA CCAAAATCAC CGGGACCATC CCCGCGAGCT 840 

CTGTCAAGAT GCTTGATCCT GGCGCCAAGA GTGCAAGGCC GGCGACCGTG TGGATTGCCC 900 

AGGACGGCTC GCACCACCTC GTCCGAGCGA GCATCGACCT CGGATCCGGG TCGATTCAGC 960 

TCACGCAGTC GAAATGGAAC GAACCCGTCA ACGTCGACTA GGC CGAAGTT GCGTCGACGC 1020 

GTTGNTCGAA ACGCCCTTGT GAACGGTGTC AACGGNAC 1058 

(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 542 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



GAATTCGGCA CGAGAGGTGA TCGACATCAT 
GGCGGCGGAG GCGGTCCAGC GGGCGCGGGA 
CATTGAGCAG GACATGGCCG TGGACAGCGC 
AGTGTCGTTC AAGATGAGGC CGGCGCAACC 
AAATCGCACG GTTTGCGGTT GATTCGTGCG 
GCGCGGCCCA GGTCCGCGTG CTGCCGTATC 
CCGGAGTTAA TGCTTCGCGT CGACCCGAAC 
CCGTGGCCAG CCCGTCGATG CCCGAGTTGC 
AGCGTCCGTA GGCGGCGGTG CTGACCGGCT 
GG 



CGGGACCAGC CCCACATCCT GGGAACAGGC 
TAGCGTCGAT GACATCCGCG TCGCTCGGGT 
CGGCAAGATC ACCTACCGCA TCAAGCTCGA 
GCGCTAGCAC GGGCCGGCGA GCAAGACGCA 
ATTTTGTGTC TGCTCGCCGA GGCCTACCAG 
CAGGCGTGCA TCGCGATTCC GGCGGCCACG 
TGGGCGATCC GCCGGNGAGC TGATCGATGA 
CCGAGGAAAC GTGCTGCCAG GCCGGTAGGA 
CTGCCTGCGC CCTCAGTGCG GCCAGCGAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
542 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16 : 

CGGTGCCGCC CGCGCCTCCG TTGCCCCCAT TGCCGCCGTC GCCGATCAGC TGCGCATCGC 60 

CACCATCACC GCCTTTGCCG CCGGCACCGC CGGTGGCGCC GGGGCCGCCG ATGCCACCGC 120 

TTGACCCTGG CCGCCGGCGC CGCCATTGCC ATACAGCACC CCGCCGGGGG CACCGTTACC 180 

GCCGTCGCCA CCGTCGCCGC CGCTGCCGTT TCAGGCCGGG GAGGCCGAAT GAACCGCCGC 240 

CAAGCCCGCC GCCGGCACCG TTGCCGCCTT TTCCGCCCGC CCCGCCGGCG CCGCCAATTG 300 

CCGAACAGCC AMGCACCGTT GCCGCCAGCC CCGCCGCCGT TAACGGCGCT GCCGGGCGCC 360 

GCCGCCGGAC CCGCCATTAC CGCCGTTCCC GTTCGGTGCC CCGCCGTTAC CGGCGCCGCC 420 

GTTTGCCGCC AATATTCGGC GGGCACCGCC AGACCCGCCG GGGC CAC CAT TGCCGCCGGG 480 

CACCGAAACA ACAGCCCAAC GGTGCCGCCG GCCCCGCCGT TTGCCGCCAT CACCGGCCAT 540 

TCACCGCCAG CACCGCCGTT AATGTTTATG AACCCGGTAC CGCCAGCGCG GCCCCTATTG 600 

CCGGGCGCCG GAGNGCGTGC CCGCCGGCGC CGCCAACGCC CAAAAGCCCG GGGTTGCCAC 660 

CGGCCCCGCC GGACCCACCG GTCCCGCCGA TCCCCCCGTT GCCGCCGGTG CCGCCGCCAT 720 

TGGTGCTGCT GAAGC CGTTA GCGCCGGTTC CGCSGGTTCC GGCGGTGGCG CCNTGGCCGC 780 

CGGCCCCGCC GTTGCCGTAC AGCCACCCCC CGGTGGCGCC GTTGCCGCCA TTGCCGCCAT 840 

TGCCGCCGTT GCCGCCATTG CCGCCGTTCC CGCCGCCACC GCCGGNTTGG CCGCCGGCGC 900 

CGCCGGCGGC CGC 913 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GACTACGTTG GTGTAGAAAA ATCCTGCCGC CCGGACCCTT AAGGCTGGGA CAATTTCTGA 60 
TAGCTACCCC GACACAGGAG GTTACGGGAT GAGCAATTCG CGCCGCCGCT CACTCAGGTG 120 
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GTCATGGTTG CTGAGCGTGC TGGCTGCCGT CGGGCTGGGC CTGGCCACGG CGCCGGCCCA 180 

GGCGGCCCCG CCGGCCTTGT CGCAGGACCG GTTCGCCGAC TTCCCCGCGC TGCCCCTCGA 240 

CCCGTCCGCG ATGGTCGCCC AAGTGGCGCC ACAGGTGGTC AACATCAACA CCAAACTGGG 300 

CTACAACAAC GCCGTGGGCG CCGGGACCGG CATCGTCATC GATCCCAACG GTGTCGTGCT 360 

GACCAACAAC CACGTGATCG CGGGCGCCAC CGACATCAAT GCGTTCAGCG TCGGCTCCGG 420 

CCAAACCTAC GGCGTCGATG TGGTCGGGTA TGACCGCACC CAGGATGTCG CGGTGCTGCA 480 

GCTGCGCGGT GCCGGTGGCC TGCCGTCGGC GGCGATCGGT GGCGGCGTCG CGGTTGGTGA 540 

GCCCGTCGTC GCGATGGGCA ACAGCGGTGG GCAGGGCGGA ACGCCCCGTG CGGTGCCTGG 600 

CAGGGTGGTC GCGCTCGGCC AAACCGTGCA GGCGTCGGAT TCGCTGACCG GTGCCGAAGA 660 

GACATTGAAC GGGTTGATCC AGTTCGATGC CGCAATCCAG CCCGGTGATT CGGGCGGGCC 720 

CGTCGTCAAC GGCCTAGGAC AGGTGGTCGG TATGAACACG GCCGCGTCCG ATAACTTCCA 780 

GCTGTCCCAG GGTGGGCAGG GATTCGCCAT TCCGATCGGG CAGGCGATGG CGATCGCGGG 840 

CCAAATCCGA TCGGGTGGGG GGTCACCCAC CGTTCATATC GGGCCTACCG CCTTCCTCGG 900 

CTTGGGTGTT GTCGACAACA ACGGCAACGG CGCACGAGTC CAACGCGTGG TCGGAAGCGC 960 

TCCGGCGGCA AGTCTCGGCA TCTCCACCGG CGACGTGATC ACCGCGGTCG ACGGCGCTCC 1020 

GATCAACTCG GCCACCGCGA TGGCGGACGC GCTTAACGGG CATCATCCCG GTGACGTCAT 1080 

CTCGGTGAAC TGGCAAACCA AGTCGGGCGG CACGCGTACA GGGAACGTGA CATTGGCCGA 1140 

GGGACCCCCG GCCTGATTTG TCGCGGATAC CACCCGCCGG CCGGCCAATT GGATTGGCGC 1200 

CAGCCGTGAT TGCCGCGTGA GCCCCCGAGT TCCGTCTCCC GTGCGCGTGG CATTGTGGAA 1260 

GCAATGAACG AGGCAGAACA CAGCGTTGAG CACCCTCCCG TGCAGGGCAG TTACGTCGAA 1320 

GGCGGTGTGG TCGAGCATCC GGATGCCAAG GACTTCGGCA GCGCCGCCGC CCTGCCCGCC 1380 

GATCCGACCT GGTTTAAGCA CGCCGTCTTC TACGAGGTGC TGGTCCGGGC GTTCTTCGAC 1440 

GCCAGCGCGG ACGGTTCCGN CGATCTGCGT GGACTCATCG ATCGCCTCGA CTACCTGCAG 1500 

TGGCTTGGCA TCGACTGCAT CTGTTGCCGC CGTTCCTACG ACTCACCGCT GCGCGACGGC 1560 

GGTTACGACA TTCGCGACTT CTACAAGGTG CTGCCCGAAT TCGGCACCGT CGACGATTTC 1620 

GTCGCCCTGG TCGACACCGC TCACCGGCGA GGTATCCGCA TCATCACCGA CCTGGTGATG 1680 

AATCACACCT CGGAGTCGCA CCCCTGGTTT CAGGAGTCCC GCCGCGACCC AGACGGACCG 1740 

TACGGTGACT ATTACGTGTG GAGCGACACC AGCGAGCGCT ACACCGACGC CCGGATCATC 18 00 

TTCGTCGACA CCGAAGAGTC GAACTGGTCA TTCGATCCTG TCCGCCGACA GTTNCTACTG 1860 

GCACCGATTC TT 1872 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IjENGTH: 1482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CTTCGCCGAA ACCTGATGCC GAGGAACAGG GTGTTCCCGT GAGCCCGACG GCGTCCGACC 60 

CCGCGCTCCT CGCCGAGATC AGGCAGTCGC TTGAT GCGAC AAAAGGGTTG ACCAGCGTGC 120 

ACGTAGCGGT CCGAACAACC GGGAAAGTCG ACAGCTTGCT GGGT ATT AC C AGTGCCGATG 180 

TCGACGTCCG GGCCAATCCG CTCGCGGCAA AGGGCGTATG CACCTACAAC GAC GAGCAGG 240 

GTGTCCCGTT TCGGGTACAA GGCGACAACA TCTCGGTGAA ACTGTTCGAC GACTGGAGCA 300 

ATCTCGGCTC GATTTCTGAA CTGTCAACTT CACGCGTGCT CGATCCTGCC GCTGGGGTGA 360 

CGCAGCTGCT GTCCGGTGTC ACGAACCTCC AAGCGCAAGG TACCGAAGTG ATAGACGGAA 420 

TTTCGACCAC CAAAATCACC GGGACCATCC CCGCGAGCTC TGTCAAGATG CTTGATCCTG 480 

GCGCCAAGAG TGCAAGGCCG GCGACCGTGT GGATTGCCCA GGACGGCTCG CACCACCTCG 540 

TCCGAGCGAG CATCGACCTC GGATCCGGGT CGATTCAGCT CACGCAGTCG AAATGGAACG 600 

AACCCGTCAA CGTCGACTAG GCCGAAGTTG CGTCGACGCG TTGCTCGAAA CGCCCTTGTG 660 

AACGGTGTCA ACGGCACCCG AAAACTGACC CCCTGACGGC ATCTGAAAAT TGACCCCCTA 720 

GACCGGGCGG TTGGTGGTTA TTCTTCGGTG GTTCCGGCTG GTGGGACGCG GCCGAGGTCG 780 

CGGTCTTTGA GCCGGTAGCT GTCGCCTTTG AGGGCGACGA CTTCAGCATG GTGGACGAGG 840 
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CGGTCGATCA TGGCGGCAGC AACGACGTCG TCGCCGCCGA AAACCTCGCC CCACCGGCCG 900 

AAGGCCTTAT TGGACGTGAC GAT C AAGCTG GCCCGCTCAT ACCGGGAGGA CACCAGCTGG 960 

AAGAAGAGGT TGGCGGC CTC GGGCTCAAAC GGAATGTAAC CGACTTCGTC AACCACCAGG 1020 

AGCGGATAGC GGCCAAACCG GGTGAGTTCG GCGTAGATGC GCCCGGCGTG GTGAGCCTCG 1080 

GCGAACCGTG CTACCCATTC GGCGGCGGTG GCGAACAGCA CCCGATGACC GGCCTGACAC 1140 

GCGCGTATCG CCAGGCCGAC CGCAAGATGA GTCTTCCCGG TGCCAGGCGG GGCCCAAAAA 1200 

CACGACGTTA TCGCGGGCGG TGATGAAATC CAGGGTGCCC AGATGTGCGA TGGTGTCGCG 1260 

TTTGAGGCCA CGAGCATGCT CAAAGTCGAA CTCTTCCAAC GACTTCCGAA CCGGGAAGCG 1320 

GGCGGCGCGG ATGCGGCCCT CACCACCATG GGACTC CCGG GCTGACACTT CCCGCTGCAG 1380 

GCAGGCGGCC AGGTATTCTT CGTGGCTCCA GTTCTCGGCG CGGGCGCGAT CGGCCAGCCG 1440 

GGACACTGAC TCACGCAGGG TGGGAGCTTT CAATGCTCTT GT 1482 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



GAATTCGGCA CGAGCCGGCG ATAGCTTCTG GGCCGCGGCC GACCAGATGG CTCGAGGGTT 60 

CGTGCT CGGG GCCACCGCCG GGCGCACCAC CCTGACCGGT GAGGGCCTGC AACACGCCGA 120 

CGGTCACTCG TTGCTGCTGG ACGCCACCAA CCCGGCGGTG GTTGCCTACG ACCCGGCCTT 180 

CGCCTACGAA ATCGGCTACA TCGNGGAAAG CGGACTGGCC AGGATGTGCG GGGAGAACCC 240 

GGAGAACATC TTCTTCTACA TCACCGTCTA CAACGAGCCG TACGTGCAGC CGCCGGAGCC 300 

GGAGAACTTC GATCCCGAGG GCGTGCTGGG GGGTATCTAC CGNTATCACG CGGCCACCGA 360 

GCAACGCACC AACAAGGNGC AGATCCTGGC CTCCGGGGTA GCGATGCCCG CGGCGCTGCG 42 0 

GGCAGCACAG ATGCTGGCCG CCGAGTGGGA TGTCGCCGCC GACGTGTGGT CGGTGACCAG 480 

TTGGGGCGAG CTAAACCGCG ACGGGGTGGT CATCGAGACC GAGAAGCTCC GCCACCCCGA 540 

TCGGCCGGCG GGCGTGCCCT ACGTGACGAG AGCGCTGGAG AATGCTCGGG GCCCGGTGAT 600 

CGCGGTGTCG GACTGGATGC GCGCGGTCCC CGAGCAGATC CGACCGTGGG TGC CGGGCAC 660 

ATACCTCACG TTGGGCACCG ACGGGTTCGG TTTTTCCGAC ACTCGGCCCG CCGGTCGTCG 720 

TTACTTCAAC ACCGACGCCG AATCCCAGGT TGGTCGCGGT TTTGGGAGGG GTTGGCCGGG 780 

TCGACGGGTG AATATCGACC CATTCGGTGC CGGTCGTGGG CCGCCCGCCC AGTTACCCGG 840 

ATTCGACGAA GGTGGGGGGT TGCGCCCGAN TAAGTT 876 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



ATCCCCCCGG GCTG CAGGAA TTCGGCACGA GAGACAAAAT TCCACGCGTT AATGCAGGAA 60 

CAGATTCATA ACGAATTCAC AGCGGCACAA CAATATGTCG CGATCGCGGT TTATTTCGAC 120 

AGCGAAGACC TGCCGCAGTT GGCGAAGCAT TTTTACAGCC AAGCGGTCGA GGAACGAAAC 180 

CATGCAATGA TGCTCGTGCA ACACCTGCTC GACCGCGACC TTCGTGTCGA AATTCCCGGC 240 

GTAGACACGG TGCGAAACCA GTTCGACAGA CCCCGCGAGG CACTGGCGCT GGCGCTCGAT 3 00 

CAGGAACGCA CAGTCACCGA CCAGGTCGGT CGGCTGACAG CGGTGGCCCG CGACGAGGGC 360 

GATTTCCTCG GCGAGCAGTT CATGCAGTGG TTCTTGCAGG AACAGATCGA AGAGGTGGCC 420 
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TTGATGGCAA CCCTGGTGCG GGTTGCCGAT CGGGCCGGGG CCAACCTGTT CGAGCTAGAG 48 0 

AACTTCGTCG CACGTGAAGT GGATGTGGCG CCGGCCGCAT CAGGCGCCCC GCACGCTGCC 540 

GGGGGCCGCC TCTAGATCCC TGGGGGGGAT CAGCGAGTGG TCCCGTTCGC CCGCCCGTCT 600 

TCCAGCCAGG CCTTGGTGCG GCCGGGGTGG TGAGTACCAA TCCAGGCCAC CCCGACCTCC 66 0 

CGGNAAAAGT CGATGTCCTC GTACTCATCG ACGTTCCAGG AGTACACCGC CCGGCCCTGA 720 

GCTGCCGAGC GGTCAACGAG TTGCGGATAT TCCTTTAACG CAGGCAGTGA GGGTCCCACG 780 

GCGGTTGGCC CGACCGCCGT GGCCGCACTG CTGGTCAGGT AT CGGGGGGT CTTGGCGAGC 84 0 

AACAACGTCG GCAGGAGGGG TGGAGCCCGC CGGATCCGCA GACCGGGGGG GCGAAAACGA 900 

CATCAACACC GCACGGGATC GATCTGCGGA GGGGGGTGCG GGAATACCGA AC CGGTGTAG 960 

GAGCGCCAGC AGTTGTTTTT CCACCAGCGA AGCGTTTTCG GGTCATCGGN GGCNNTTAAG 1020 

T 1021 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO :21: 

CGTGC CGACG AACGGAAGAA CACAACCATG AAGATGGTGA AATCGATCGC CGCAGGTCTG 6 0 

ACCGCCGCGG CTGCAATCGG CGCCGCTGCG GCCGGTGTGA CTTCGATCAT GGCTGGCGGN 120 

CCGGTCGTAT ACCAGATGCA GCCGGTCGTC TTCGGCGCGC CACTGCCGTT GGACCCGGNA 18 0 

TCCGCCCCTG ANGTCCCGAC CGCCGCCCAG TGGACCAGNC TGCTCAACAG NCTCGNCGAT 240 

CCCAACGTGT CGTTTGNGAA CAAGGGNAGT CTGGTCGAGG GNGGNATCGG NGGNANCGAG 300 

GGNGNGNATC GNCGANCACA A 321 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

TCTTATCGGT TCCGGTTGGC GACGGGTTTT GGGNGCGGGT GGTTAACCCG CTCGGCCAGC 60 

CGATCGACGG GCGCGGAGAC GTCGACTCCG ATACTCGGCG CGCGCTGGAG CTCCAGGCGC 120 

CCTCGGTGGT GNACCGGCAA GGCGTGAAGG AGC CGTTGNA GACCGGGATC AAGGCGATTG 180 

ACGCGATGAC CCCGATCGGC CGCGGGCAGC GCCAGCTGAT CATCGGGGAC CGCAAGACCG 24 0 

GCAAAAACCG CCGTCTGTGT CGGACACCAT CCTCAAACCA GCGGGAAGAA CTGGGAGTCC 300 

GGTGGATCCC AAGAAGCAGG TGCGCTTGTG TATACGTTGG CCATCGGGCA AGAAGGGGAA 360 

CTTACCATCG CCG 373 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 352 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GTGACGCCGT GATGGGATTC CTGGGCGGGG CCGGTCCGCT GGCGGTGGTG GATCAGCAAC 60 

TGGTTACCCG GGTGCCGCAA GGCTGGTCGT TTGCTCAGGC AGCCGCTGTG CCGGTGGTGT 12 0 

TCTTGACGGC CTGGTACGGG TTGGCCGATT TAGCCGAGAT CAAGGCGGGC GAATCGGTGC 18 0 

TGATCCATGC CGGTACCGGC GGTGTGGGCA TGGCGGCTGT GCAGCTGGQT CGCCAGTGGG 240 

GCGTGGAGGT TTTCGTCACC GCCAGCCGTG GNAAGTGGGA CACGCTGCGC GCCATNGNGT 300 

TTGACGACGA NCCATATCGG NGATTCCCNC ACATNCGAAG TTCCGANGGA GA 352 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GAAATCCGCG TTCATTCCGT TCGACCAGCG GCTGGCGATA ATCGACGAAG TGATCAAGCC 60 

GCGGTTCGCG GCGCTCATGG GTCACAGCGA GTAATCAGCA AGTTCT CTGG TATATCGCAC 120 

CTAGCGTCCA GTTGCTTGCC AGATCGCTTT CGTACCGTCA TCGCATGTAC CGGTTCGCGT 180 

GCCGCACGCT CATGCTGGCG GCGTGCATCC TGGCCACGGG TGTGGCGGGT CTCGGGGTCG 240 

GCGCGCAGTC CGCAGCCCAA ACCGCGCCGG TGCCCGACTA CTACTGGTGC CCGGGGCAGC 300 

CTTTCGACCC CGCATGGGGG CCCAACTGGG ATCCCTACAC CTGCCATGAC GACTTCCACC 36 0 

GCGACAGCGA CGGCCCCGAC CACAGCCGCG ACTACCCCGG ACCCATCCTC GAAGGTCCCG 42 0 

TGCTTGACGA TCCCGGTGCT GCGCCGCCGC CCCCGGCTGC CGGTGGCGGC GCATAGCGCT 48 0 

CGTTGAC CGG GCCGCATCAG CGAATACGCG TATAAACCCG GGCGTGCCCC CGGCAAGCTA 540 

CGACCCCCGG CGGGGCAGAT TTACGCTCCC GTGC CGATGG ATCGCGCCGT CCGATGACAG 600 

AAAATAGGCG ACGGTTTTGG CAACCGCTTG GAGGACGCTT GAAGGGAACC TGTCATGAAC 660 

GGCGACAGCG CCTCCACCAT CGACATCGAC AAGGTTGTTA CCCGCACACC CGTTCGCCGG 720 

ATCGTG 726 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 580 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CGCGACGACG ACGAACGTCG GGCCCACCAC CGCCTATGCG TTGATGCAGG CGACCGGGAT 6 0 

GGTCGCCGAC CATATCCAAG CATGCTGGGT GCCCACTGAG CGAC CTTTTG ACCAGCCGGG 12 0 

CTGCCCGATG GCGGCCCGGT GAAGTCATTG CGCCGGGGCT TGTGCACCTG ATGAACCCGA 18 0 

ATAGGGAACA ATAGGGGGGT GATTTGGCAG TTCAATGTCG GGTATGGCTG GAAATCCAAT 240 

GGCGGGGCAT GCTCGGCGCC GACCAGGCTC GCGCAGGCGG GCCAGCCCGA ATCTGGAGGG 30 0 

AGCACTCAAT GGCGGCGATG AAGCCCCGGA CCGGCGACGG TCCTTTGGAA GCAACTAAGG 360 

AGGGGCGCGG CATTGTGATG CGAGTACCAC TTGAGGGTGG CGGTCGCCTG GTCGTCGAGC 420 

TGACACCCGA CGAAGCCGCC GCACTGGGTG ACGAACTCAA AGGCGTTACT AGCTAAGACC 480 

AGCCCAACGG CGAATGGTCG GCGTTACGCG CACACCTTCC GGTAGATGTC CAGTGTCTGC 540 

TCGGCGATGT ATGCCCAGGA GAACTCTTGG ATACAGCGCT 580 



(2) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

AACGGAGGCG C CGGGGGTTT TGGCGGGGCC GGGGCGGTCG GCGGCAACGG CGGGGCCGGC 60 
GGTACCGCCG GGTTGTTCGG TGTCGGCGGG GCCGGTGGGG CCGGAGGCAA CGGCATCGCC 120 
GGTGTCACGG GTACGTCGGC CAGCACACCG GGTGGATCCG 160 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GACACCGATA CGATGGTGAT GTACGC CAAC GTTGTCGACA CGCTCGAGGC GTTCACGATC 60 

CAGCGCACAC CCGACGGCGT GACCATCGGC GATGCGGCCC CGTTCGCGGA GGCGGCTGCC 120 

AAGGCGATGG GAATCGACAA GCTGCGGGTA ATTCATACCG GAATGGACCC CGTCGTCGCT 18 0 

GAACGCGAAC AGTGGGACGA CGGCAACAAC ACGTTGGCGT TGGCGCCCGG TGTCGTTGTC 24 0 

GCCTACGAGC GCAACGTACA GACCAACGCC CG 272 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.28: 

GCAGCCGGTG GTTCTCGGAC TATCTGCGCA CGGTGACGCA GCGCGACGTG CGCGAGCTGA 60 

AGCGGATCGA GCAGACGGAT CGCCTGCCGC GGTTCATGCG CTACCTGGCC GCTATCACCG 120 

CGCAGGAGCT GAACGTGGCC GAAGCGGCGC GGGTCATCGG GGTCGACGCG GGGACGATCC 180 

GTTCGGATCT GGCGTGGTTC GAGACGGTCT ATCTGGTACA TCGCCTGCCC GCCTGGTCGC 240 

GGAATCTGAC CGCGAAGATC AAGAAGCGGT CAAAGATCCA CGTCGTCGAC AGTGGCTTCG 3 00 

CGGCCTGGTT GCGCGGG 317 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
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GATCGTGGAG CTGTCGATGA ACAGCGTTGC CGGACGCGCG GCGGCCAGCA CGTCGGTGTA 60 

GCAGCGCCGG ACCACCTCGC CGGTGGGCAG CATGGTGATG ACCACGTCGG CCTCGGCCAC 120 

CGCTTCGGGC GCGCTACGAA ACACCGCGAC ACCGTGCGCG GCGGCGCCGG ACGCCGCCGT 180 

GG 182 

(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 



GATCGCGAAG TTTGGTGAGC AGGTGGTCGA CGCGAAAGTC TGGGCGCCTG CGAAGCGGGT 60 

CGGCGTTCAC GAGGCGAAGA CACGCCTGTC CGAGCTGCTG CGGCTCGTCT ACGGCGGGCA 120 

GAGGTTGAGA TTGCCCGCCG CGGCGAGCCG GTAGCAAAGC TTGTGCCGCT GCATCCTCAT 180 

GAGACTCGGC GGTTAGGCAT TGACCATGGC GTGTACCGCG TGCCCGACGA TTTGGACGCT 240 

CCGTTGTCAG ACGACGTGCT CGAACGCTTT CACCGGTGAA GCGCTACCTC ATCGACACCC 3 00 

ACGTTTGG 308 



INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



CCGACGACGA GCAACTCACG TGGATGATGG TCGGCAGCGG CATTGAGGAC GGAGAGAATC 60 

CGGCCGAAGC TGCCGCGCGG CAAGTGCTCA TAGTGACCGG CCGTAGAGGG CTCCCCCGAT 120 

GGCACCGGAC TATTCTGGTG TGCCGCTGGC CGGTAAGAGC GGGTAAAAGA ATGTGAGGGG 180 

ACACGATGAG CAATCACACC TACCGAGTGA TCGAGATCGT CGGGACCTCG CCCGACGGCG 240 

TCGACGCGGC AATCCAGGGC GGTCTGG 267 



<2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



CTCGTGCCGA AAGAATGTGA GGGGACACGA TGAGCAATCA CACCTACCGA GTGATCGAGA 60 

TCGTCGGGAC CTCGCCCGAC GGCGTCGACG CGGCAATCCA GGGCGGTCTG GCCCGAGCTG 120 

CGCAGACCAT GCGCGCGCTG GACTGGTTCG AAGTACAGTC AATTCGAGGC CACCTGGTCG 180 

ACGGAGCGGT CGCGCACTTC CAGGTGACTA TGAAAGTCGG CTTCCGCTGG AGGATTCCTG 240 

AACCTTCAAG CGCGGCCGAT AACTGAGGTG CATCATTAAG CGACTTTTCC AGAACATCCT 300 

GACGCGCTCG AAACGCGGTT CAGCCGACGG TGGCTCCGCC GAGGCGCTGC CTCCAAAATC 360 

CCTGCGACAA TTCGTCGGCG GCGCCTACAA GGAAGTCGGT GCTGAATTCG TCGGGTATCT 420 
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GGTCGACCTG TGTGGGCTGC AGCCGGACGA AGCGGTGCTC GACGTCGGCT GCGGCTCGGG 480 

GCGGATGGCG TTGCCGCTCA CCGGCTATCT GAACAGCGAG GGACGCTACG CCGGCTTCGA 540 

TATCTCGCAG AAAGCCATCG CGTGGTGCCA GGAGCACATC ACCTCGGCGC ACCCCAACTT 600 

CCAGTTCGAG GTCTCCGACA TCTACAACTC GCTGTACAAC CCGAAAGGGA AATACCAGTC 660 

ACTAGACTTT CGCTTTCCAT ATCCGGATGC GTCGTTCGAT GTGGTGTTTC TTACCTCGGT 720 

GTTCACCCAC ATGTTTCCGC CGGACGTGGA GCACTATCTG GACGAGATCT CCCGCGTGCT 780 

GAAGCCCGGC GGACGATGCC TGTGCACGTA CTTCTTGCTC AATGACGAGT CGTTAGC CCA 840 

CATCGCGGAA GGAAAGAGTG CGCACAACTT CCAGCATGAG GGACCGGGTT AT CGGAC AAT 900 

CCACAAGAAG CGGCCCGAAG AAGCAATCGG CTTGC CGGAG ACCTTCGTCA GGGATGTCTA 960 

TGGCAAGTTC GGCCTCGCCG TGCACGAACC ATTGCACTAC GGCTCATGGA GTGGCCGGGA 102 0 

ACCACGCCTA AGCTTCCAGG ACATCGTCAT CGCGACCAAA ACCGCGAGCT AGGTCGGCAT 1080 

CCGGGAAGCA TCGCGACACC GTGGCGC CGA GCGCCGCTGC CGGCAGGCCG ATTAGGCGGG 1140 

CAGATTAGCC CGCCGCGGCT CCCGGCTCCG AGTACGGCGC CCCGAATGGC GTCACCGGCT 1200 

GGTAACCACG CTTGCGCGCC TGGGCGGCGG CCTGC CGGAT CAGGTGGTAG ATGCCGACAA 1260 

AGCCTGCGTG ATCGGTCATC ACCAACGGTG ACAGCAGCCG GTTGTGCACC AGCGCGAACG 1320 

CCACCCCGGT CTCCGGGTCT GTCCAGCCGA TCGAGCCGCC CAAGCCCACA TGACCAAACC 1380 

CCGGCATCAC GTTGCCGATC GGCATACCGT GATAGCCAAG ATGAAAATTT AAGGGCACCA 1440 

ATAGATTTCG ATCCGGCAGA ACTTGCCGTC GGTTGCGGGT CAGGCCCGTG ACCAGCTCCC 1500 

GCGACAAGAA CCGTATGCCG TCGATCTCGC CTCGTGCCG 1539 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CTGCAGGGTG GCGTGGATGA G.CGTCACCGC GGGGCAGGCC GAGCTGACCG CCGCCCAGGT 60 

CCGGGTTGCT GCGGCGGCCT ACGAGACGGC GTATGGGCTG ACGGTGCCCC CGCCGGTGAT 120 

CGCCGAGAAC CGTGCTGAAC TGATGATTCT GATAGCGACC AACCTCTTGG GGCAAAACAC 180 

CCCGGCGATC GCGGTCAACG AGGCCGAATA CGGCGAGATG TGGGCCCAAG ACGCCGCCGC 240 

GATGTTTGGC TACGCCGCGG CGACGGCGAC GGCGACGGCG ACGTTGCTGC CGTTCGAGGA 300 

GGCGC CGGAG ATGACCAGCG CGGGTGGGCT CCTCGAGCAG GCCGCCGCGG TCGAGGAGGC 360 

CTCCGACACC GCCGCGGCGA ACCAGTTGAT GAACAATGTG CCCCAGGCGC TGAAACAGTT 420 

GGCCCAGCCC ACGCAGGGCA CCACGCCTTC TTCCAAGCTG GGTGGCCTGT GGAAGACGGT 480 

CTCGCCGCAT CGGTCGCCGA TCAGCAACAT GGTGTCGATG GCCAACAACC ACATGTCGAT 540 

GACCAACTCG GGTGTGTCGA TGACCAACAC CTTGAGCTCG ATGTTGAAGG GCTTTGCTCC 600 

GGCGGCGGCC GCCCAGGCCG TGCAAACCGC GGCGCAAAAC GGGGTCCGGG CGATGAG CTC 660 

GCTGGGCAGC TCGCTGGGTT CTTCGGGTCT GGGCGGTGGG GTGGCCGCCA ACTTGGGTCG 720 

GGCGGCCTCG GTACGGTATG GTCACCGGGA TGGCGGAAAA TATGCANAGT CTGGTCGGCG 780- 

GAACGGTGGT CCGGCGTAAG GTTTACCCCC GTTTTCTGGA TGCGGTGAAC TTCGTCAACG 840 

GAAACAGTTA C 851 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
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GATCGATCGG GCGGAAATTT GGACCAGATT CGCCTCCGGC GATAACCCAA TCAATCGAAC 60 

CTAGATTTAT TCCGTCCAGG GGCCCGAGTA ATGGCTCGCA GGAGAGGAAC CTTACTGCTG 12 0 

CGGGCACCTG TCGTAGGTCC TCGATACGGC GGAAGGCGTC GACATTTTCC ACCGACACCC 180 

CCATCCAAAC GTTCGAGGGC CACTCCAGCT TGTGAGCGAG GCGACGCAGT CGCAGGCTGC 240 

GCTTGGTCAA GATC " 254 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GATCCTGACC GAAGCGGCCG CCGCCAAGGC GAAGT CGCTG TTGGACCAGG AGGGACGGGA 60 

CGATCTGGCG CTGCGGATCG CGGTTCAGCC GGGGGGGTGC GCTGGATTGC GCTATAACCT 120 

TTTCTTCGAC GACCGGACGC TGGATGGTGA CCAAACCGCG GAGTTCGGTG GTGTCAGGTT 180 

GATCGTGGAC CGGATGAGCG CGCCGTATGT GGAAGGCGCG TCGATCGATT TCGTCGACAC 240 

TATTGAGAAG CAAGGTTCAC CATCGACAAT CCCAACGCCA CCGGCTCCTG CGCGTGCGGG 3 00 

GATTCGTTCA ACTGATAAAA CGCTAGTACG ACCCCGCGGT GCGCAACACG TACGAGCACA 360 

CCAAGACCTG ACCGCGCTGG AAAAGCAACT GAGCGATGCC TTGCACCTGA CCGCGTGGCG 420 

GGCCGCCGGC GGCAGGTGTC ACCTGCATGG TGAACAGCAC CTGGGC CTGA TATTGCGACC 480 

AGTACACGAT TTTGTCGATC GAGGTCACTT CGACCTGGGA GAACTGCTTG CGGAACGCGT 540 

CGCTGCTCAG CTTGGC CAAG GCCTGATCGG AGCGCTTGTC GCGCACGCCG TCGTGGATAC 600 

CGCACAGCGC ATTGCGAACG ATGGTGTCCA CATCGCGGTT CTCCAGCGCG TTGAGGTATC 660 

CCTGAATCGC GGTTTTGGCC GGTCCCTCCG AGAATGTGCC TGCCGTGTTG GCT CCGTTGG 720 

TGCGGACCCC GTATATGATC GCCGCCGTCA TAGCCGACAC CAGCGCGAGG GCTACCACAA 780 

TGCCGATCAG CAGCCGCTTG TGCCGTCGCT TCGGGTAGGA CACCTGCGGC GGCACGCCGG 840 

GATATGCGGC GGGCGGCAGC GCCGCGTCGT CTGCCGGTCC CGGGGCGAAG GCCGGTTCGG 900 

CGGCGCCGAG GTCGTGGGGG TAGTCCAGGG CTTGGGGTTC GTGGGATGAG GGCTCGGGGT 960 

ACGGCGCCGG TCCGTTGGTG CCGACACCGG GGTTCGGCGA GTGGGGACCG GGCATTGTGG 1020 

TTCTCCTAGG GTGGTGGACG GGACCAGCTG CTAGGGCGAC AACCGCCCGT CGCGTCAGCC 1080 

GGCAGCATCG GCAATCAGGT GAGCTCCCTA GGCAGGCTAG CGCAACAGCT GCCGTCAGCT 1140 

CTCAACGCGA CGGGGCGGGC CGCGGCGCCG ATAATGTTGA AAGACTAGGC AACCTTAGGA 1200 

ACGAAGGACG GAGATTTTGT GACGATC 1227 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GCGGTGTCGG CGGATCCGGC GGGTGGTTGA ACGGCAACGG CGGGGCCGGC GGGGCCGGCG 60 

GGACCGGCGC TAACGGTGGT GCCGGCGGCA ACGCCTGGTT GTTCGGGGCC GGCGGGTCCG 120 

GCGGNGCCGG CACCAATGGT GGNGTCGGCG GGTCCGGCGG ATTTGT CTAC GGCAACGGCG 180 

G 181 



(2) INFORMATION FOR SEQ ID NO: 37: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



GCGGTGTCGG CGGATCCGGC GGGTGGTTGA ACGGCAACGG CGGTGTCGGC GGCCGGGGCG 60 

GCGACGGCGT CTTTGCCGGT GCCGGCGGCC AGGGCGGCCT CGGTGGGCAG GGCGGCAATG 120 

GCGGCGGCTC CACCGGCGGC AACGGCGGTC TTGGCGGCGC GGGCGGTGGC GGAGGCAACG 180 

CCCCGGACGG CGGCTT CGGT GGCAACGGCG GTAAGGGTGG CCAGGG CGGN ATTGGCGGCG 240 

GCACTCAGAG CGCGACCGGC CTCGGNGGTG ACGGCGGTGA CGGCGGTGAC 290 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
GATCCAGTGG CATGGNGGGT GTCAGTGGAA GCAT 34 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GATCGCTGCT CGTCCCCCCC TTGCCGCCGA CGCCACCGGT CCCACCGTTA CCGAACAAGC 60 
TGGCGTGGTC GCCAGCACCC CCGGCACCGC CGACGC CGGA GTCGAACAAT GGCACCGTCG 120 
TATCCCCACC ATTGCCGCCG GNCCCACCGG CACCG 155 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
ATGGCGTTCA CGGGGCGCCG GGGACCGGGC AGCCCGGNGG GGCCGGGGGG TGG 53 



(2) INFORMATION FOR SEQ ID NO: 41: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

GATCCACCGC GGGTGCAGAC GGTGCCCGCG GCGCCACCCC GACCAGCGGC GGCAACGGCG 60 
GCACCGGCGG CAACGGCGCG AACGCCACCG TCGTCGGNGG GGCCGGCGGG GCCGGCGGCA 120 
AGGGCGGCAA CG 132 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

GATCGGCGGC CGGNAC GGNC GGGGACGGCG GCAAGGGCGG NAACGGGGGC GCCGNAGCCA 60 
CCNGCCAAGA ATCCTCCGNG TCCNCCAATG GCGCGAATGG CGGACAGGGC GGCAACGGCG 12 0 

GCANCGGCGG CA 132 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

CGGCACGAGG ATCGGTACCC CGCGGCATCG GCAGCTGCCG ATTCGCCGGG TTTCCCCACC 60 

CGAGGAAAGC CGCTACCAGA TGGCGCTGCC GAAGTAGGGC GATCCGTTCG CGATGCCGGC 120 

ATGAACGGGC GGCATCAAAT TAGTGCAGGA ACCTTTCAGT TTAGCGACGA TAATGGCTAT 180 

AGCACTAAGG AGGATGATCC GATATGACGC AGTCGCAGAC CGTGACGGTG GATCAGCAAG 240 

AGATTTTGAA CAGGGCCAAC GAGGTGGAGG CCCCGATGGC GGACCCACCG ACTGATGTCC 300 

CCATCACACC GTGCGAACTC ACGGNGGNTA AAAACGCCGC CCAACAGNTG GTNTTGTCCG 360 

CCGACAACAT GCGGGAATAC CTGGCGGCCG GTGCCAAAGA GCGGCAGCGT CTGGCGACCT 420 

CGCTGCGCAA CGCGGCCAAG GNGTATGGCG AGGTTGATGA GGAGGCTGCG ACCGCGCTGG 480 

ACAACGACGG CGAAGGAACT GTGCAGGCAG AATCGGCCGG GGC CGTCGGA GGGGACAGTT 540 

CGGCCGAACT AACCGATACG CCGAGGGTGG CCACGGCCGG TGAACCCAAC TTCATGGATC 600 

TCAAAGAAGC GGCAAGGAAG CTCGAAACGG GCGACCAAGG CGCATCGCTC GCGCACTGNG 660 

GGGATGGGTG GAACACTTNC ACCCTGACGC TGCAAGGCGA CG 702 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GAAGCCGCAG CGCTGTCGGG CGACGTGGCG GTCAAAGCGG CATCGCTCGG TGGCGGTGGA 60 

GGCGGCGGGG TGCCGTCGGC GCCGTTGGGA TCCGCGATCG GGGGCGCCGA ATCGGTGCGG 12 0 

CCCGCTGGCG CTGGTGACAT TGCCGGCTTA GGCCAGGGAA GGGCCGGCGG CGGCGCCGCG 180 

CTGGGCGGCG GTGGCATGGG AATGCCGATG GGTGCCGCGC ATCAGGGACA AGGGGGCGCC 240 

AAGTCCAAGG GTTCTCAGCA GGAAGACGAG GCGCTCTACA CCGAGGATCC TCGTGCCG 298 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1058 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

CGGCACGAGG ATCGAATCGC GTC GCCGGGA GCACAGCGTC GCACTGCACC AGTGGAGGAG 60 

CCATGACCTA CTCGCCGGGT AACCCCGGAT ACCCGCAAGC GCAGCCCGCA GGCTCCTACG 120 

GAGGCGTCAC ACCCTCGTTC GCCCACGCCG ATGAGGGTGC GAGCAAGCTA CCGATGTACC 180 

TGAACATCGC GGTGGCAGTG CTCGGTCTGG CTGCGTACTT CGCCAGCTTC GGCC CAATGT 240 

TCACCCTCAG TACCGAACTC GGGGGGGGTG ATGGCGCAGT GTC CGGTGAC ACTGGGCTGC 300 

CGGTCGGGGT GGCTCTGCTG GCTGCGCTGC TTGCCGGGGT GGTTCTGGTG CCTAAGGCCA 360 

AGAGCCATGT GACGGTAGTT GCGGTGCTCG GGGTACTCGG CGTATTTCTG ATGGTCTCGG 420 

CGACGTTTAA CAAGCCCAGC GCCTATTCGA CCGGTTGGGC ATTGTGGGTT GTGTTGGCTT 480 

TCATCGTGTT CCAGGCGGTT GCGGCAGTCC TGGCGCTCTT GGTGGAGACC GGCGCTATCA 540 

CCGCGCCGGC GCCGCGGCCC AAGTTCGACC CGTATGGACA GTACGGGCGG TACGGGCAGT 600 

ACGGGCAGTA CGGGGTGCAG CCGGGTGGGT ACTACGGTCA GCAGGGTGCT CAGCAGGCCG 660 

CGGGACTGCA GTCGCCCGGC CCGCAGCAGT CTCCGCAGCC TCCCGGATAT GGGTCGCAGT 720 

ACGGCGGCTA TTCGTCCAGT CCGAGCCAAT CGGGCAGTGG ATACACTGCT CAGCCCCCGG 780 

CCCAGCCGCC GGCGCAGTCC GGGTCGCAAC AATCGCACCA GGGCCCATCC ACGCCACCTA 840 

CCGGCTTTCC GAGCTTCAGC CCACCACCAC CGGTCAGTGC CGGGACGGGG TCGCAGGCTG 900 

GTTCGGCTCC AGTCAACTAT TCAAACCCCA GCGGGGGCGA GCAGTCGTCG TCCCCCGGGG 960 

GGGCGCCGGT CTAACCGGGC GTTCCCGCGT CCGGTCGCGC GTGTGCGCGA AGAGTGAACA 1020 

GGGTGTCAGC AAGCGCGGAC GATCCTCGTG CCGAATTC 1058 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

CGGCACGAGA GACCGATGCC GCTACCCTCG CGCAGGAGGC AGGTAATTTC GAGCGGATCT 60 

CCGGCGACCT GAAAACCCAG ATCGACCAGG TGGAGTCGAC GGCAGGTTCG TTGCAGGGCC 120 

AGTGGCGCGG GGCGGCGGGG ACGGCCGCCC AGGC CGCGGT GGTGCGCTTC CAAGAAGCAG 180 

CCAATAAGCA GAAGCAGGAA CTCGACGAGA TCTCGACGAA TATTCGTCAG GCCGGCGTCC 240 

AATACTCGAG GGCCGACGAG GAGCAGCAGC AGGCGCTGTC CTCGCAAATG GGCTTCTGAC 300 



CCGCTAATAC GAAAAGAAAC GGAGCAA 



327 
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(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CGGTCGCGAT GATGGCGTTG TCGAACGTGA CCGATTCTGT ACCGCCGTCG TTGAGATCAA 60 
CCAACAACGT GTTGGCGTCG GCAAATGTGC CGNACCCGTG GATCTCGGTG ATCTTGTTCT 120 
TCTTCATCAG GAAGTGCACA CCGGCCACCC TGCCCTCGGN TACCTTTCGG 170 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

GATCCGGCGG CACGGGGGGT GCCGGCGGCA GCACCGCTGG CGCTGGCGGC AACGGCGGGG 60 
CCGGGGGTGG CGGCGGAACC GGTGGGTTGC TCTTCGGCAA CGGCGGTGCC GGCGGGCACG 120 
GGGCCGT I 27 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

CGGCGGCAAG GGCGGCACCG CCGGCAACGG GAGCGGCGCG GCCGGCGGCA ACGGCGGCAA 60 
CGGCGGCTCC GGCCTCAACG G 81 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

GATCAGGGCT GGCCGGCTCC GGCCAGAAGG GCGGTAACGG AGGAGCTGCC GGATTGTTTG 60 
GCAACGGCGG GGCCGGNGGT GCCGGCGCGT CCAACCAAGC CGGTAACGGC GGNGCCGGCG 120 
GAAACGGTGG TGCCGGTGGG CTGATCTGG 149 
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(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CGGCACGAGA TCACACCTAC CGAGTGATCG AGATCGTCGG GACCTCGCCC GACGGTGTCG 60 

ACGCGGNAAT CCAGGGCGGT CTGGCCCGAG CTGCGCAGAC CATGCGCGCG CTGGACTGGT 120 

TCGAAGTACA GTCAATTCGA GGCCACCTGG TCGACGGAGC GGTCGCGCAC TTCCAGGTGA 180 

CTATGAAAGT CGGCTTCCGC CTGGAGGATT CCTGAACCTT CAAGCGCGGC CGATAACTGA 240 

GGTGCATCAT TAAGCGACTT TTCCAGAACA TCCTGACGCG CTCGAAACGC GGTTCAGCCG 300 

ACGGTGGCTC CGCCGAGGCG CTGCCTCCAA AATCCCTGCG ACAATTCGTC GGCGG 355 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

ATGCATCACC ATCACCATCA CATGCATCAG GTGGACCCCA ACTTGACACG TCGCAAGGGA 60 

CGATTGGCGG CACTGGCTAT CGCGGCGATG GCCAGCGCCA GCCTGGTGAC CGTTGCGGTG 120 

CCCGCGACCG CCAACGCCGA TCCGGAGCCA GCGCCCCCGG TACCCACAAC GGCCGCCTCG 180 

CCGCCGTCGA CCGCTGCAGC GCCACCCGCA CCGGCGACAC CTGTTGCCCC CCCACCACCG 240 

GCCGCCGCCA ACACGCCGAA TGCCCAGCCG GGCGATCCCA ACGCAGCACC TCCGCCGGCC 300 

GACCCGAACG CACCGCCGCC ACCTGTCATT GCCCCAAACG CACCCCAACC TGTCCGGATC 360 

GACAACCCGG TTGGAGGATT CAGCTTCGCG CTGCCTGCTG GCTGGGTGGA GTCTGACGCC 420 

GCCCACTTCG ACTACGGTTC AGCACTCCTC AGCAAAACCA CCGGGGACCC GCCATTTCCC 480 

GGACAGCCGC CGCCGGTGGC CAATGACACC CGTATCGTGC TCGGCCGGCT AGACCAAAAG 540 

CTTTACGCCA GCGCCGAAGC CACCGACTCC AAGGCCGCGG CCCGGTTGGG CTCGGACATG 600 

GGTGAGTTCT ATATGCCCTA CCCGGGCACC CGGATCAACC AGGAAACCGT CTCGCTCGAC 660 

GCCAACGGGG TGTCTGGAAG CGCGTCGTAT TACGAAGTCA AGTTCAGCGA TCCGAGTAAG 72 0 

CCGAACGGCC AGATCTGGAC GGGCGTAATC GGCTCGCCCG CGGCGAACGC ACCGGACGCC 780 

GGGCCCCCTC AGCGCTGGTT TGTGGTATGG CTCGGGACCG CCAACAACCC GGTGGACAAG 840 

GGCGCGGCCA AGGCGCTGGC CGAATCGATC CGGCCTTTGG TCGCCCCGCC GCCGGCGCCG 900 

GCACCGGCTC CTGCAGAGCC CGCTCCGGCG CCGGCGCCGG CCGGGGAAGT CGCTCCTACC 960 

CCGACGACAC CGACACCGCA GCGGACCTTA CCGGCCTGA 999 

(2) INFORMATION FOR SEQ ID NO : 53 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
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Met His His His His His His Met His Gin Val Asp Pro Asn Leu Thr 

15 10 15 

Arg Arg Lys Gly Arg Leu Ala Ala Leu Ala lie Ala Ala Met Ala Ser 

20 25 30 

Ala Ser Leu Val Thr Val Ala Val Pro Ala Thr Ala -Asn Ala Asp Pro 

35 40 45 

Glu Pro Ala Pro Pro Val Pro Thr Thr Ala Ala Ser Pro Pro Ser Thr 

50 55 60 

Ala Ala Ala Pro Pro Ala Pro Ala Thr Pro Val Ala Pro Pro Pro Pro 
65 70 75 80 

Ala Ala Ala Asn Thr Pro Asn Ala Gin Pro Gly Asp Pro Asn Ala Ala 

85 90 95 

Pro Pro Pro Ala Asp Pro Asn Ala Pro Pro Pro Pro Val lie Ala Pro 

100 105 110 

Asn Ala Pro Gin Pro Val Arg lie Asp Asn Pro Val Gly Gly Phe Ser 

115 120 125 

Phe Ala Leu Pro Ala Gly Trp Val Glu Ser Asp Ala Ala His Phe Asp 

130 135 140 

Tyr Gly Ser Ala Leu Leu Ser Lys Thr Thr Gly Asp Pro Pro Phe Pro 
145 150 155 160 

Gly Gin Pro Pro Pro Val Ala Asn Asp Thr Arg lie Val Leu Gly Arg 

165 170 175 

Leu Asp Gin Lys Leu Tyr Ala Ser Ala Glu Ala Thr Asp Ser Lys Ala 

180 185 190 

Ala Ala Arg Leu Gly Ser Asp Met Gly Glu Phe Tyr Met Pro Tyr Pro 

195 200 205 

Gly Thr Arg He Asn Gin Glu Thr Val Ser Leu Asp Ala Asn Gly Val 

210 215 220 

Ser Gly Ser Ala Ser Tyr Tyr Glu Val Lys Phe Ser Asp Pro Ser Lys 
225 230 235 240 

Pro Asn Gly Gin He Trp Thr Gly Val He Gly Ser Pro Ala Ala Asn 

245 250 255 

Ala Pro Asp Ala Gly Pro Pro Gin Arg Trp Phe Val Val Trp Leu Gly 

260 265 270 

Thr Ala Asn Asn Pro Val Asp Lys Gly Ala Ala Lys Ala Leu Ala Glu 

275 280 285 

Ser lie Arg Pro Leu Val Ala Pro Pro Pro Ala Pro Ala Pro Ala Pro 

290 295 300 

Ala Glu Pro Ala Pro Ala Pro Ala Pro Ala Gly Glu Val Ala Pro Thr 
305 310 315 320 

Pro Thr Thr Pro Thr Pro Gin Arg Thr Leu Pro Ala 
325 330 



INFORMATION FOR SEQ ID NO: 54: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



Asp Pro Val Asp Ala Val He Asn Thr Thr Xaa Asn Tyr Gly Gin Val 
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15 10 15 

Val Ala Ala Leu 
20 

INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Ala Val Glu Ser Gly Met Leu Ala Leu Gly Thr Pro Ala Pro Ser 
IS 10 15 

INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Ala Ala Met Lys Pro Arg Thr Gly Asp Gly Pro Leu Glu Ala Ala Lys 
15 10 15 

Glu Gly Arg 

INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Tyr Tyr Trp Cys Pro Gly Gin Pro Phe Asp Pro Ala Trp Gly Pro 
1 " 5 10 15 

INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
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Asp lie Gly Ser Glu Ser Thr Glu Asp Gin Gin Xaa Ala Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Ala Glu Glu Ser lie Ser Thr Xaa Glu Xaa lie Val Pro 
15 10 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Asp Pro Glu Pro Ala Pro Pro Val Pro Thr Ala Ala Ala Ala Pro Pro 
15 10 15 

Ala 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Ala Pro Lys Thr Tyr Xaa Glu Glu Leu Lys Gly Thr Asp Thr Gly 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Asp Pro Ala Ser Ala Pro Asp Val Pro Thr Ala Ala Gin Gin Thr Ser 
15 10 15 
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Leu Leu Asn Asn Leu Ala Asp Pro Asp Val Ser Phe Ala Asp 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Thr Gly Ser Leu Asn Gin Thr His Asn Arg Arg Ala Asn Glu Arg Lys 

15 10 .15 

Asn Thr Thr Met Lys Met Val Lys Ser He Ala Ala Gly Leu Thr Ala 

20 25 30 

Ala Ala Ala He Gly Ala Ala Ala Ala Gly Val Thr Ser He Met Ala 

35 40 45 

Gly Gly Pro Val Val Tyr Gin Met Gin Pro Val Val Phe Gly Ala Pro 

50 55 60 

Leu Pro Leu Asp Pro Ala Ser Ala Pro Asp Val Pro Thr Ala Ala Gin 
65 70 75 80 

Leu Thr Ser Leu Leu Asn Ser Leu Ala Asp Pro Asn Val Ser Phe Ala 

85 90 95 

Asn Lys Gly Ser Leu Val Glu Gly Gly He Gly Gly Thr Glu Ala Arg 

100 105 HO 

He Ala Asp His Lys Leu Lys Lys Ala Ala Glu His Gly Asp Leu Pro 

115 120 125 

Leu Ser Phe Ser Val Thr Asn He Gin Pro Ala Ala Ala Gly Ser Ala 

130 135 140 

Thr Ala Asp Val Ser Val Ser Gly Pro Lys Leu Ser Ser Pro Val Thr 
145 150 155 16 0 

Gin Asn Val Thr Phe Val Asn Gin Gly Gly Trp Met Leu Ser Arg Ala 

165 170 175 

Ser Ala Met Glu Leu Leu Gin Ala Ala Gly Xaa 
180 185 

(2) INFORMATION FOR SEQ ID NO -.64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Asp Glu Val Thr Val Glu Thr Thr Ser Val Phe Arg Ala Asp Phe Leu 

15 io 15 

Ser Glu Leu Asp Ala Pro Ala Gin Ala Gly Thr Glu Ser Ala Val Ser 

20 25 30 

Gly Val Glu Gly Leu Pro Pro Gly Ser Ala Leu Leu Val Val Lys Arg 

35 40 45 

Gly Pro Asn Ala Gly Ser Arg Phe Leu Leu Asp Gin Ala He Thr Ser 
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Ala Gly Arg His Pro Asp Ser Asp He Phe Leu Asp Asp Val Thr Val 
65 70 75 80 

Ser Arg Arg His Ala Glu Phe Arg Leu Glu Asn Asn Glu Phe Asn Val 

85 90 95 

Val Asp Val Gly Ser Leu Asn Gly Thr Tyr Val Asn Arg Glu Pro Val 

100 105 HO 

Asp Ser Ala Val Leu Ala Asn Gly Asp Glu Val Gin He Gly Lys Leu 

115 120 125 

Arg Leu Val Phe Leu Thr Gly Pro Lys Gin Gly Glu Asp Asp Gly Ser 

130 135 140 

Thr Gly Gly Pro 
145 

INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 230 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Thr Ser Asn Arg Pro Ala Arg Arg Gly Arg Arg Ala Pro Arg Asp Thr 

15 10 15 

Gly Pro Asp Arg Ser Ala Ser Leu Ser Leu Val Arg His Arg Arg Gin 

20 25 30 

Gin Arg Asp Ala Leu Cys Leu Ser Ser Thr Gin He Ser Arg Gin Ser 

35 40 45 

Asn Leu Pro Pro Ala Ala Gly Gly Ala Ala Asn Tyr Ser Arg Arg Asn 

50 55 60 

Phe Asp Val Arg He Lys He Phe Met Leu Val Thr Ala Val Val Leu 
65 70 75 80 

Leu Cys Cys Ser Gly Val Ala Thr Ala Ala Pro Lys Thr Tyr Cys Glu 

85 90 95 

Glu Leu Lys Gly Thr Asp Thr Gly Gin Ala Cys Gin He Gin Met Ser 

100 105 HO 

Asp Pro Ala Tyr Asn He Asn He Ser Leu Pro Ser Tyr Tyr Pro Asp 

115 120 125 

Gin Lys Ser Leu Glu Asn Tyr He Ala Gin Thr Arg Asp Lys Phe Leu 

130 135 140 

Ser Ala Ala Thr Ser Ser Thr Pro Arg Glu Ala Pro Tyr Glu Leu Asn 
145 150 155 160 

He Thr Ser Ala Thr Tyr Gin Ser Ala He Pro Pro Arg Gly Thr Gin 

165 170 175 

Ala Val Val Leu Xaa Val Tyr His Asn Ala Gly Gly Thr His Pro Thr 

180 185 190 

Thr Thr Tyr Lys Ala Phe Asp Trp Asp Gin Ala Tyr Arg Lys Pro He 

195 200 205 

Thr Tyr Asp Thr Leu Trp Gin Ala Asp Thr Asp Pro Leu Pro Val Val 

210 215 220 

Phe Pro He Val Ala Arg 
225 230 
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INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Thr Ala Ala Ser Asp Asn Phe Gin Leu Ser Gin Gly Gly Gin Gly Phe 

15 10 15 

Ala He Pro He Gly Gin Ala Met Ala He Ala Gly Gin He Arg Ser 

20 25 30 

Gly Gly Gly Ser Pro Thr Val His He Gly Pro Thr Ala Phe Leu Gly 

35 40 45 

Leu Gly Val Val Asp Asn Asn Gly Asn Gly Ala Arg Val Gin Arg Val 

50 55 60 

Val Gly Ser Ala Pro Ala Ala Ser Leu Gly He Ser Thr Gly Asp Val 
65 70 75 80 

He Thr Ala Val Asp Gly Ala Pro He Asn Ser Ala Thr Ala Met Ala 

85 90 95 

Asp Ala Leu Asn Gly His His Pro Gly Asp Val He Ser Val Asn Trp 

100 105 HO 

Gin Thr Lys Ser Gly Gly Thr Arg Thr Gly Asn Val Thr Leu Ala Glu 

115 120 125 

Gly Pro Pro Ala 
13 0 

INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

Val Pro Leu Arg Ser Pro Ser Met Ser Pro Ser Lys Cys Leu Ala Ala 

15 10 15 

Ala Gin Arg Asn Pro Val He Arg Arg Arg Arg Leu Ser Asn Pro Pro 

20 25 30 

Pro Arg Lys Tyr Arg Ser Met Pro Ser Pro Ala Thr Ala Ser Ala Gly 

35 40 45 

Met Ala Arg Val Arg Arg Arg Ala He Trp Arg Gly Pro Ala Thr Xaa 

50 55 60 

Ser Ala Gly Met Ala Arg Val Arg Arg Trp Xaa Val Met Pro Xaa Val 
65 70 75 80 

He Gin Ser Thr Xaa He Arg Xaa Xaa Gly Pro Phe Asp Asn Arg Gly 
85 90 95 

Ser Glu Arg Lys 
100 

INFORMATION FOR SEQ ID NO: 68: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



Met 


Thr 


Asp Asp 


He 


Leu 


Leu 


He 


Asp Thr Asp Glu Arg Val Arg Thr 


1 






5 








10 15 


Leu 


Thr 


Leu Asn 


Arg 


Pro 


Gin 


Ser 


Arg Asn Ala Leu Ser Ala Ala Leu 
















25 30 


Arg 


Asp 


Arg Phe 


Phe 


Ala 


Xaa 


Leu 


Xaa Asp Ala Glu Xaa Asp Asp Asp 






35 








40 


45 


He 


Asp 


Val Val 


He 


Leu 


Thr 


Gly 


Ala Asp Pro Val Phe Cys Ala Gly 




50 








55 




60 


Leu 


Asp 


Leu Lys 


Val 


Ala 


Gly 


Arg 


Ala Asp Arg Ala Ala Gly His Leu 


65 








70 






75 80 


Thr 


Ala 


Val Gly 


Gly His 


Asp 


Gin 


Ala Gly Asp Arg Arg Asp Gin Arg 








85 








90 95 


Arg 


Arg 


Gly His 


Arg 


Arg 


Ala 


Arg 


Thr Gly Ala Val Leu Arg His Pro 






100 










105 110 


Asp 


Arg 


Leu Arg 


Ala 


Arg 


Pro 


Leu 


Arg Arg His Pro Arg Pro Gly Gly 






115 








120 


125 


Ala 


Ala 


Ala His 


Leu Gly 


Thr 


Gin 


Cys Val Leu Ala Ala Lys Gly Arg 




130 








13 5 




140 


His 


Arg 


Xaa Gly 


Pro 


Val 


Asp 


Glu 


Pro Asp Arg Arg Leu Pro Val Arg 


145 








150 






155 160 


Asp 


Arg 


Arg 













(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



Met 


Lys 


Phe 


Val 


Asn 


His 


He 


Glu 


Pro Val Ala Pro Arg Arg Ala Gly 


1 








5 








10 15 


Gly 


Ala 


Val 


Ala 
20 


Glu 


Val 


Tyr 


Ala 


Glu Ala Arg Arg Glu Phe Gly Arg 
25 30 


Leu 


Pro 


Glu 
35 


Pro 


Leu 


Ala 


Met 


Leu 
40 


Ser Pro Asp Glu Gly Leu Leu Thr 
45 


Ala 


Gly 
50 


Trp 


Ala 


Thr 


Leu 


Arg 
55 


Glu 


Thr Leu Leu Val Gly Gin Val Pro 
60 


Arg 


Gly 


Arg 


Lys 


Glu 


Ala 


Val 


Ala 


Ala Ala Val Ala Ala Ser Leu Arg 


65 










70 






75 80 


Cys 


Pro 


Trp 


Cys 


Val 
85 


Asp 


Ala 


His 


Thr Thr Met Leu Tyr Ala Ala Gly 
90 95 


Gin 


Thr 


Asp 


Thr 
100 


Ala 


Ala 


Ala 


He 


Leu Ala Gly Thr Ala Pro Ala Ala 
105 110 



100 



Gly 


Asp 


Pro 


Asn 


Ala 


Pro 


Tyr 


Val 


Ala Trp Ala Ala Gly Thr Gly Thr 






115 










120 


125 


Pro 


Ala 


Gly 


Pro 


Pro 


Ala 


Pro 


Phe 


Gly Pro Asp Val Ala Ala Glu Tyr 




130 










135 




140 


Leu 


Gly 


Thr 


Ala 


Val 


Gin 


Phe 


His 


Phe He Ala Arg Leu Val Leu Val 


145 










150 






155 160 


Leu 


Leu 


Asp 


Glu 


Thr 


Phe 


Leu 


Pro 


Gly Gly Pro Arg Ala Gin Gin Leu 










165 








170 175 


Met 


Arg 


Arg 


Ala 


Gly 


Gly 


Leu 


Val 


Phe Ala Arg Lys Val Arg Ala Glu 








180 










185 190 


His 


Arg 


Pro 


Gly 


Arg 


Ser 


Thr 


Arg 


Arg Leu Glu Pro Arg Thr Leu Pro 






195 










200 


205 


Asp 


Asp 


Leu 


Ala 


Trp 


Ala 


Thr 


Pro 


Ser Glu Pro He Ala Thr Ala Phe 




210 










215 




220 


Ala 


Ala 


Leu 


Ser 


His 


His 


Leu 


Asp 


Thr Ala Pro His Leu Pro Pro Pro 


225 










230 






235 240 


Thr 


Arg 


Gin 


Val 


Val 


Arg 


Arg 


Val 


Val Gly Ser Trp His Gly Glu Pro 










245 








250 255 


Met 


Pro 


Met 


Ser 


Ser 


Arg 


Trp 


Thr 


Asn Glu His Thr Ala Glu Leu Pro 








260 










265 270 


Ala 


Asp 


Leu 


His 


Ala 


Pro 


Thr 


Arg 


Leu Ala Leu Leu Thr Gly Leu Ala 






275 










280 


285 


Pro 


His 


Gin 


Val 


Thr 


Asp 


Asp 


Asp 


Val Ala Ala Ala Arg Ser Leu Leu 




290 










295 




300 


Asp 


Thr 


Asp 


Ala 


Ala 


Leu 


Val 


Gly 


Ala Leu Ala Trp Ala Ala Phe Thr 


305 










310 






315 320 


Ala 


Ala 


Arg 


Arg 


He 


Gly 


Thr 


Trp 


He Gly Ala Ala Ala Glu Gly Gin 










325 








330 335 


Val 


Ser 


Arg 


Gin 


Asn 


Pro 


Thr 


Gly 










340 













INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 



Asp 


Asp Pro 


Asp 


Met 


Pro 


Gly 


Thr 


Val Ala Lys Ala 


Val 


Ala 


Asp 


Ala 


1 






5 








10 






15 




Leu 


Gly Arg 


Gly 


He 


Ala 


Pro 


Val 


Glu Asp He Gin Asp Cys 


Val 


Glu 






20 










25 




30 






Ala 


Arg Leu 


Gly 


Glu 


Ala 


Gly Leu 


Asp Asp Val Ala 


Arg 


Val 


Tyr 


He 




35 










40 




45 








He 


Tyr Arg 
50 


Gin 


Arg 


Arg 


Ala 
55 


Glu 


Leu Arg Thr Ala 
60 


Lys 


Ala 


Leu 


Leu 


Gly 


Val Arg 


Asp 


Glu 


Leu 


Lys 


Leu 


Ser Leu Ala Ala 


Val 


Thr 


Val 


Leu 


65 








70 






75 








80 


Arg 


Glu Arg 


Tyr 


Leu 
85 


Leu 


His 


Asp 


Glu Gin Gly Arg 
90 


Pro 


Ala 


Glu 
95 


Ser 


Thr 


Gly Glu 


Leu 
100 


Met 


Asp 


Arg 


Ser 


Ala Arg Cys Val 
105 


Ala 


Ala 
X10 


Ala 


Glu 



101 



Asp 


Gin 


Tyr Glu Pro Gly 


Ser Ser Arg Arg 


Trp 


Ala 


Glu 


Arg 


Phe 


Ala 






115 


120 






125 








Thr 


Leu 


Leu Arg Asn Leu 


Glu Phe Leu Pro 


Asn 


Ser 


Pro 


Thr 


Leu 


Met 




130 




135 




140 










Asn 


Ser 


Gly Thr Asp Leu 


Gly Leu Leu Ala 


Gly 


Cys 


Phe 


Val 


Leu 


Pro 


145 




150 




155 










160 


lie 


Glu 


Asp Ser Leu Gin 


Ser He Phe Ala 


Thr 


Leu 


Gly 


Gin 


Ala 


Ala 






165 


170 










175 




Glu 


Leu 


Gin Arg Ala Gly 


Gly Gly Thr Gly 


Tyr 


Ala 


Phe 


Ser 


His 


Leu 






180 


185 








190 






Arg 


Pro 


Ala Gly Asp Arg 


Val Ala Ser Thr 


Gly 


Gly 


Thr 


Ala 


Ser Gly 






195 


200 






205 








Pro 


Val 


Ser Phe Leu Arg 


Leu Tyr Asp Ser 


Ala 


Ala 


Gly 


Val 


Val 


Ser 




210 




215 




220 










Met 


Gly 


Gly Arg Arg Arg 


Gly Ala Cys Met 


Ala 


Val 


Leu 


Asp 


Val 


Ser 


225 




230 




235 










240 


His 


Pro 


Asp He Cys Asp 


Phe Val Thr Ala 


Lys 


Ala 


Glu 


Ser 


Pro 


Ser 






245 


250 










255 




Glu 


Leu 


Pro His Phe Asn 


Leu Ser Val Gly 


Val 


Thr 


Asp 


Ala 


Phe 


Leu 






260 


265 








270 






Arg 


Ala 


Val Glu Arg Asn 


Gly Leu His Arg 


Leu 


Val 


Asn 


Pro 


Arg 


Thr 






275 


280 






285 








Gly 


Lys 


He Val Ala Arg 


Met Pro Ala Ala 


Glu 


Leu 


Phe 


Asp 


Ala 


He 




290 




295 




300 










Cys 


Lys 


Ala Ala His Ala 


Gly Gly Asp Pro 


Gly 


Leu 


Val 


Phe 


Leu Asp 


305 




310 




315 










320 


Thr 


He 


Asn Arg Ala Asn 


Pro Val Pro Gly 


Arg 


Gly 


Arg 


He 


Glu 


Ala 






325 


330 










335 




Thr 


Asn 


Pro Cys Gly Glu 


Val Pro Leu Leu 


Pro 


Tyr 


Glu 


Ser 


Cys 


Asn 






340 


345 








350 






Leu 


Gly 


Ser He Asn Leu 


Ala Arg Met Leu 


Ala 


Asp 


Gly 


Arg 


Val 


Asp 






355 


360 






365 








Trp 


Asp 


Arg Leu Glu Glu 


Val Ala Gly Val 


Ala 


Val 


Arg 


Phe 


Leu Asp 




370 




375 




380 










Asp 


Val 


He Asp Val Ser 


Arg Tyr Pro Phe 


Pro 


Glu 


Leu 


Gly 


Glu 


Ala 


385 




390 




395 










400 


Ala 


Arg 


Ala Thr Arg Lys 


He Gly Leu Gly 


Val 


Met 


Gly 


Leu 


Ala 


Glu 






405 


410 










415 




Leu 


Leu 


Ala Ala Leu Gly 


He Pro Tyr Asp 


Ser 


Glu 


Glu 


Ala 


Val 


Arg 






420 


425 








430 






Leu 


Ala 


Thr Arg Leu Met 


Arg Arg He Gin 


Gin 


Ala 


Ala 


His 


Thr 


Ala 






435 


440 






445 








Ser 


Arg 


Arg Leu Ala Glu 


Glu Arg Gly Ala 


Phe 


Pro 


Ala 


Phe 


Thr 


Asp 




450 




455 




460 










Ser 


Arg 


Phe Ala Arg Ser 


Gly Pro Arg Arg 


Asn 


Ala 


Gin 


Val 


Thr 


Ser 


465 




470 




475 










480 


Val 


Ala 


Pro Thr Gly 

















485 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
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(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Gly Val lie Val Leu Asp Leu Glu Pro Arg Gly Pro Leu Pro Thr Glu 

15 10 15 

lie Tyr Trp Arg Arg Arg Gly Leu Ala Leu Gly lie Ala Val Val Val 

20 25 30 

Val Gly He Ala Val Ala He Val He Ala Phe Val Asp Ser Ser Ala 

35 40 45 

Gly Ala Lys Pro Val Ser Ala Asp Lys Pro Ala Ser Ala Gin Ser His 

50 55 60 

Pro Gly Ser Pro Ala Pro Gin Ala Pro Gin Pro Ala Gly Gin Thr Glu 
65 70 75 80 

Gly Asn Ala Ala Ala Ala Pro Pro Gin Gly Gin Asn Pro Glu Thr Pro 

85 90 95 

Thr Pro Thr Ala Ala Val Gin Pro Pro Pro Val Leu Lys Glu Gly Asp 

100 105 110 

Asp Cys Pro Asp Ser Thr Leu Ala Val Lys Gly Leu Thr Asn Ala Pro 

115 120 125 

Gin Tyr Tyr Val Gly Asp Gin Pro Lys Phe Thr Met Val Val Thr Asn 

130 135 140 

He Gly Leu Val Ser Cys Lys Arg Asp Val Gly Ala Ala Val Leu Ala 
145 * 150 155 160 

Ala Tyr Val Tyr Ser Leu Asp Asn Lys Arg Leu Trp Ser Asn Leu Asp 

165 170 175 

Cys Ala Pro Ser Asn Glu Thr Leu Val Lys Thr Phe Ser Pro Gly Glu 

180 185 190 

Gin Val Thr Thr Ala Val Thr Trp Thr Gly Met Gly Ser Ala Pro Arg 

195 200 205 

Cys Pro Leu Pro Arg Pro Ala He Gly Pro Gly Thr Tyr Asn Leu Val 

210 215 220 

Val Gin Leu Gly Asn Leu Arg Ser Leu Pro Val Pro Phe He Leu Asn 
225 230 235 240 

Gin Pro Pro Pro Pro Pro Gly Pro Val Pro Ala Pro Gly Pro Ala Gin 

245 250 255 

Ala Pro Pro Pro Glu Ser Pro Ala Gin Gly Gly 
260 265 



INFORMATION FOR SEQ ID NO: 72: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Leu He Ser Thr Gly Lys Ala Ser His Ala Ser Leu Gly Val Gin Val 

15 10 15 

Thr Asn Asp Lys Asp Thr Pro Gly Ala Lys lie Val Glu Val Val Ala 

20 25 30 

Gly Gly Ala Ala Ala Asn Ala Gly Val Pro Lys Gly Val Val Val Thr 
35 40 45 
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Lys Val Asp Asp Arg Pro lie Asn Ser Ala Asp Ala Leu Val Ala Ala 

50 55 60 

Val Arg Ser Lys Ala Pro Gly Ala Thr Val Ala Leu Thr Phe Gin Asp 
65 70 75 80 

Pro Ser Gly Gly Ser Arg Thr Val Gin Val Thr Leu Gly Lys Ala Glu 
85 90 _ 95 

Gin 

INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 364 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Gly Ala Ala Val Ser Leu Leu Ala Ala Gly Thr Leu Val Leu Thr Ala 

15 10 15 

Cys Gly Gly Gly Thr Asn Ser Ser Ser Ser Gly Ala Gly Gly Thr Ser 

20 25 30 

Gly Ser Val His Cys Gly Gly Lys Lys Glu Leu His Ser Ser Gly Ser 

35 40 45 

Thr Ala Gin Glu Asn Ala Met Glu Gin Phe Val Tyr Ala Tyr Val Arg 

50 55 60 

Ser Cys Pro Gly Tyr Thr Leu Asp Tyr Asn Ala Asn Gly Ser Gly Ala 
65 70 75 80 

Gly Val Thr Gin Phe Leu Asn Asn Glu Thr Asp Phe Ala Gly Ser Asp 

85 90 95 

Val Pro Leu Asn Pro Ser Thr Gly Gin Pro Asp Arg Ser Ala Glu Arg 

100 105 110 

Cys Gly Ser Pro Ala Trp Asp Leu Pro Thr Val Phe Gly Pro lie Ala 

115 120 125 

He Thr Tyr Asn He Lys Gly Val Ser Thr Leu Asn Leu Asp Gly Pro 

130 135 140 

Thr Thr Ala Lys He Phe Asn Gly Thr He Thr Val Trp Asn Asp Pro 
145 150 155 160 

Gin He Gin Ala Leu Asn Ser Gly Thr Asp Leu Pro Pro Thr Pro He 

165 170 175 

Ser Val He Phe Arg Ser Asp Lys Ser Gly Thr Ser Asp Asn Phe Gin 

180 185 190 

Lys Tyr Leu Asp Gly Val Ser Asn Gly Ala Trp Gly Lys Gly Ala Ser 

195 200 205 

Glu Thr Phe Ser Gly Gly Val Gly Val Gly Ala Ser Gly Asn Asn Gly 

210 215 220 

Thr Ser Ala Leu Leu Gin Thr Thr Asp Gly Ser He Thr Tyr Asn Glu 
225 230 235 240 

Trp Ser Phe Ala Val Gly Lys Gin Leu Asn Met Ala Gin He He Thr 

245 250 255 

Ser Ala Gly Pro Asp Pro Val Ala He Thr Thr Glu Ser Val Gly Lys 

260 265 270 

Thr He Ala Gly Ala Lys He Met Gly Gin Gly Asn Asp Leu Val Leu 

275 280 285 

Asp Thr Ser Ser Phe Tyr Arg Pro Thr Gin Pro Gly Ser Tyr Pro He 
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290 295 
Val Leu Ala Thr Tyr Glu He Val 
305 310 
Thr Gly Thr Ala Val Arg Ala Phe 
325 

Gin Glu Gly Leu Asp Gin Tyr Gly 
340 

Gin Ala Lys Leu Ala Ala Ala Val 

355 360 



300 

Cys Ser Lys Tyr Pro Asp Ala Thr 
315 320 
Met Gin Ala Ala He Gly Pro Gly 

330 335 
Ser He Pro Leu Pro Lys Ser Phe 
345 " 350 

Asn Ala He Ser 



INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Gin Ala Ala Ala Gly Arg Ala Val Arg Arg Thr Gly His Ala Glu Asp 

15 io 15 

Gin Thr His Gin Asp Arg Leu His His Gly Cys Arg Arg Ala Ala Val 

20 25 30 

Val Val Arg Gin Asp Arg Ala Ser Val Ser Ala Thr Ser Ala Arg Pro 

35 40 45 

Pro Arg Arg His Pro Ala Gin Gly His Arg Arg Arg Val Ala Pro Ser 

50 55 60 

Gly Gly Arg Arg Arg Pro His Pro His His Val Gin Pro Asp Asp Arg 
65 * 70 75 80 

Arg Asp Arg Pro Ala Leu Leu Asp Arg Thr Gin Pro Ala Glu His Pro 

85 90 95 

Asp Pro His Arg Arg Gly Pro Ala Asp Pro Gly Arg Val Arg Gly Arg 

100 105 HO 

Gly Arg Leu Arg Arg Val Asp Asp Gly Arg Leu Gin Pro Asp Arg Asp 

115 120 125 

Ala Asp His Gly Ala Pro Val Arg Gly Arg Gly Pro His Arg Gly Val 

130 135 140 

Gin His Arg Gly Gly Pro Val Phe Val Arg Arg Val Pro Gly Val Arg 
145 150 155 160 

Cys Ala His Arg Arg Gly His Arg Arg Val Ala Ala Pro Gly Gin Gly 

165 170 175 

Asp Val Leu Arg Ala Gly Leu Arg Val Glu Arg Leu Arg Pro Val Ala 

180 185 190 

Ala Val Glu Asn Leu His Arg Gly Ser Gin Arg Ala Asp Gly Arg Val 

195 200 205 

Phe Arg Pro He Arg Arg Gly Ala Arg Leu Pro Ala Arg Arg Ser Arg 

210 215 220 

Ala Gly Pro Gin Gly Arg Leu His Leu Asp Gly Ala Gly Pro Ser Pro 
225 230 235 240 

Leu Pro Ala Arg Ala Gly Gin Gin Gin Pro Ser Ser Ala Gly Gly Arg 

245 250 255 

Arg Ala Gly Gly Ala Glu Arg Ala Asp Pro Gly Gin Arg Gly Arg His 

260 265 270 

His Gin Gly Gly His Asp Pro Gly Arg Gin Gly Ala Gin Arg Gly Thr 
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275 280 285 

Ala Gly Val Ala His Ala Ala Ala Gly Pro Arg Arg Ala Ala Val Arg 

290 295 300 

Asn Arg Pro Arg Arg 
305 



INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 580 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



Ser Ala Val Trp Cys Leu Asn Gly Phe Thr Gly Arg His Arg His Gly 

15 10 15 

Arg Cys Arg Val Arg Ala Ser Gly Trp Arg Ser Ser Asn Arg Trp Cys 

20 25 30 

Ser Thr Thr Ala Asp Cys Cys Ala Ser Lys Thr Pro Thr Gin Ala Ala 

35 40 45 

Ser Pro Leu Glu Arg Arg Phe Thr Cys Cys Ser Pro Ala Val Gly Cys 

50 55 60 

Arg Phe Arg Ser Phe Pro Val Arg Arg Leu Ala Leu Gly Ala Arg Thr 
65 70 75 80 

Ser Arg Thr Leu Gly Val Arg Arg Thr Leu Ser Gin Trp Asn Leu Ser 

85 90 95 

Pro Arg Ala Gin Pro Ser Cys Ala Val Thr Val Glu Ser His Thr His 

100 105 110 

Ala Ser Pro Arg Met Ala Lys Leu Ala Arg Val Val Gly Leu Val Gin 

115 120 125 

Glu Glu Gin Pro Ser Asp Met Thr Asn His Pro Arg Tyr Ser Pro Pro 

130 135 140 

Pro Gin Gin Pro Gly Thr Pro Gly Tyr Ala Gin Gly Gin Gin Gin Thr 
145 150 155 160 

Tyr Ser Gin Gin Phe Asp Trp Arg Tyr Pro Pro Ser Pro Pro Pro Gin 

165 170 175 

Pro Thr Gin Tyr Arg Gin Pro Tyr Glu Ala Leu Gly Gly Thr Arg Pro 

180 185 190 

Gly Leu lie Pro Gly Val lie Pro Thr Met Thr Pro Pro Pro Gly Met 

195 200 205 

Val Arg Gin Arg Pro Arg Ala Gly Met Leu Ala lie Gly Ala Val Thr 

210 215 220 

lie Ala Val Val Ser Ala Gly He Gly Gly Ala Ala Ala Ser Leu Val 
225 230 235 240 

Gly Phe Asn Arg Ala Pro Ala Gly Pro Ser Gly Gly Pro Val Ala Ala 

245 250 255 

Ser Ala Ala Pro Ser He Pro Ala Ala Asn Met Pro Pro Gly Ser Val 

260 265 270 

Glu Gin Val Ala Ala Lys Val Val Pro Ser Val Val Met Leu Glu Thr 

275 280 285 

Asp Leu Gly Arg Gin Ser Glu Glu Gly Ser Gly He He Leu Ser Ala 

290 295 300 

Glu Gly Leu He Leu Thr Asn Asn His Val He Ala Ala Ala Ala Lys 
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305 310 315 320 





Pro 


eu 






ro 


ro 


Pro Lys Thr 


Thr Val Thr Phe Ser Asp 










325 












335 


y 


Arg 


Thr 
r 


Ala 


Pro 
ro 


Phe 


Thr 


Val Val Gly 


Ala 


Asp 


Pro Thr Ser Asp 








340 














350 


e 


a 


a 


Val 


Arg 


Va 


G n 


ij±y vai oer 


Gly Leu .Thr Pro lie Ser 
















360 






365 


Leu 




Ser 


Ser 


Ser 


Asp 




Arg Val Gly 


Gin 


Pro 


Val Leu Ala He 




3 70 










375 






380 






er 


ro 


Leu 


G y 




U 


vj-Ly iul Vai 


Thr 


Thr 


Gly He Val Ser 


385 
















395 




400 


3 


Leu 


Asn 


Arg 


Pro 


Val 


Ser 


Thr Thr Gly 


Glu 


Ala 


Gly Asn Gin Asn 






















415 


T r 


Va 


Leu 


Asp 


Ala 


I e 


G n 


Thr Asp Ala 


Ala 


He 


Asn Pro Gly Asn 
















425 






430 


Ser 


Gly 


Gly 


Ala 


Leu 


Val 


Asn 


Met Asn Ala 


Gin 


Leu 


Val Gly Val Asn 






435 










440 






445 


Ser 


Ala 


lie 


Ala 


Thr 


Leu 


Gly 


Ala Asp Ser 


Ala 


Asp 


Ala Gin Ser Gly 




450 










455 






460 




Ser 


lie 


Gly 


Leu 


Gly 


Phe 


Ala 


lie Pro Val 


Asp 


Gin 


Ala Lys Arg He 


465 










470 






475 




480 


Ala 


Asp 


Glu 


Leu 


lie 


Ser 


Thr 


Gly Lys Ala 


Ser 


His 


Ala Ser Leu Gly 










485 






490 






495 


Val 


Gin 


Val 


Thr 


Asn 


Asp 


Lys 


Asp Thr Pro 


Gly Ala 


Lys He Val Glu 






















510 


Val 


Val 


Ala 


Gly 


Gly 


Ala 


Ala 


Ala Asn Ala 


Gly Val 


Pro Lys Gly Val 






515 










520 






525 


Val 


Val 


Thr 


Lys 


Val 


Asp 


Asp 


Arg Pro lie 


Asn 


Ser 


Ala Asp Ala Leu 




530 










535 






540 




Val 


Ala 


Ala 


Val 


Arg 


Ser 


Lys 


Ala Pro Gly 


Ala 


Thr 


Val Ala Leu Thr 


545 










550 






555 




560 


Phe 


Gin 


Asp 


Pro 


Ser 


Gly 


Gly 


Ser Arg Thr 


Val 


Gin 


Val Thr Leu Gly 










565 






570 






575 


Lys 


Ala 


Glu 


Gin 

















580 

INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Met Asn Asp Gly Lys Arg Ala Val Thr Ser Ala Val 

15 10 

Gly Ala Cys Leu Ala Leu Trp Leu Ser Gly Cys Ser 

20 25 
Asp Ala Glu Glu Gin Gly Val Pro Val Ser Pro Thr 

35 40 
Ala Leu Leu Ala Glu He Arg Gin Ser Leu Asp Ala 

50 55 60 

Thr Ser Val His Val Ala Val Arg Thr Thr Gly Lys 



Leu Val Val Leu 
15 

Ser Pro Lys Pro 
30 

Ala Ser Asp Pro 
45 

Thr Lys Gly Leu 
Val Asp Ser Leu 
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65 










70 




75 


80 


Leu 


Gly 


lie 


Thr 


Ser 


Ala 


Asp Val 


Asp Val Arg 


Ala Asn Pro Leu Ala 










85 






90 




Ala 


Lys 


Gly 


Val 


Cys Thr 


Tyr Asn 


Asp Glu Gin 


Gly Val Pro Phe Arg 








100 








105 


110 


Val 


Gin 


Gly 


Asp 


Asn 


He 


Ser Val 


Lys Leu Phe 


Asp .Asp Trp Ser Asn 






115 








120 




125 


Leu 


Gly 


Ser 


lie 


Ser 


Glu 


Leu Ser 


Thr Ser Arg 


Val Leu Asp Pro Ala 




130 










135 




140 


Ala 


Gly 


Val 


Thr 


Gin 


Leu 


Leu Ser 


Gly Val Thr 


Asn Leu Gin Ala Gin 


145 










150 




155 




dy- 


Thr 


Glu 


Val 


He 


Asp 


Gly He 


Ser Thr Thr 


Lys He Thr Gly Thr 










165 










He 


Pro 


Ala 


Ser 


Ser 


Val 


Lys Met 


Leu Asp Pro 


Gly Ala Lys Ser Ala 








180 








185 


190 


Arg 


Pro 


Ala 


Thr 


Val 


Trp 


He Ala 


Gin Asp Gly 


Ser His His Leu Val 






195 








200 




205 


Arg 


Ala 


Ser 


He 


Asp 


Leu 


Gly Ser 


Gly Ser He 


Gin Leu Thr Gin Ser 




210 










215 




220 


Lys 


Trp 


Asn 


Glu 


Pro 


Val 


Asn Val 


Asp 




225 










230 









INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Val He Asp He He Gly Thr Ser Pro Thr Ser Trp Glu Gin Ala Ala 

15 10 15 

Ala Glu Ala Val Gin Arg Ala Arg Asp Ser Val Asp Asp He Arg Val 

20 25 30 

Ala Arg Val He Glu Gin Asp Met Ala Val Asp Ser Ala Gly Lys He 

35 40 45 

Thr Tyr Arg He Lys Leu Glu Val Ser Phe Lys Met Arg Pro Ala Gin 

50 55 60 

Pro Arg 
65 

INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Val Pro Pro Ala Pro Pro Leu Pro Pro Leu Pro Pro Ser Pro He Ser 
15 10 15 



108 



Cys Ala Ser Pro Pro Ser Pro Pro Leu Pro Pro Ala Pro Pro Val Ala 

20 25 30 

Pro Gly Pro Pro Met Pro Pro Leu Asp Pro Trp Pro Pro Ala Pro Pro 

35 40 45 

Leu Pro Tyr Ser Thr Pro Pro Gly Ala Pro Leu Pro Pro Ser Pro Pro 

50 55 60 

Ser Pro Pro Leu Pro 
65 



INFORMATION FOR SEQ ID NO: 79: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



Met Ser Asn Ser Arg Arg Arg Ser Leu Arg Trp Ser Trp Leu Leu Ser 

15 10 15 

Val Leu Ala Ala Val Gly Leu Gly Leu Ala Thr Ala Pro Ala Gin Ala 

20 25 30 

Ala Pro Pro Ala Leu Ser Gin Asp Arg Phe Ala Asp Phe Pro Ala Leu 

35 40 45 

Pro Leu Asp Pro Ser Ala Met Val Ala Gin Val Ala Pro Gin Val Val 

50 55 60 

Asn lie Asn Thr Lys Leu Gly Tyr Asn Asn Ala Val Gly Ala Gly Thr 
65 70 75 80 

Gly lie Val lie Asp Pro Asn Gly Val Val Leu Thr Asn Asn His Val 

85 90 95 

He Ala Gly Ala Thr Asp He Asn Ala Phe Ser Val Gly Ser Gly Gin 

100 105 110 

Thr Tyr Gly Val Asp Val Val Gly Tyr Asp Arg Thr Gin Asp Val Ala 

115 120 125 

Val Leu Gin Leu Arg Gly Ala Gly Gly Leu Pro Ser Ala Ala He Gly 

130 135 140 

Gly Gly Val Ala Val Gly Glu Pro Val Val Ala Met Gly Asn Ser Gly 
145 150 155 160 

Gly Gin Gly Gly Thr Pro Arg Ala Val Pro Gly Arg Val Val Ala Leu 

165 170 175 

Gly Gin Thr Val Gin Ala Ser Asp Ser Leu Thr Gly Ala Glu Glu Thr 

180 185 190 

Leu Asn Gly Leu He Gin Phe Asp Ala Ala He Gin Pro Gly Asp Ser 

195 200 205 

Gly Gly Pro Val Val Asn Gly Leu Gly Gin Val Val Gly Met Asn Thr 

210 215 220 

Ala Ala Ser Asp Asn Phe Gin Leu Ser Gin Gly Gly Gin Gly Phe Ala 
225 230 235 240 

He Pro He Gly Gin Ala Met Ala He Ala Gly Gin He Arg Ser Gly 

245 250 255 

Gly Gly Ser Pro Thr Val His He Gly Pro Thr Ala Phe Leu Gly Leu 

260 265 270 

Gly Val Val Asp Asn Asn Gly Asn Gly Ala Arg Val Gin Arg Val Val 
275 280 285 
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Gly Ser Ala Pro Ala Ala Ser Leu Gly He Ser Thr Gly Asp Val He 

290 295 300 

Thr Ala Val Asp Gly Ala Pro lie Asn Ser Ala Thr Ala Met Ala Asp 
305 310 315 320 

Ala Leu Asn Gly His His Pro Gly Asp Val He Ser Val Asn Trp Gin 

325 330 - 335 

Thr Lys Ser Gly Gly Thr Arg Thr Gly Asn Val Thr Leu Ala Glu Gly 
340 345 350 

Pro Pro Ala 
355 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Ser Pro Lys Pro Asp Ala Glu Glu Gin Gly Val Pro Val Ser Pro Thr 

15 10 15 

Ala Ser Asp Pro Ala Leu Leu Ala Glu He Arg Gin Ser Leu Asp Ala 

20 25 30 

Thr Lys Gly Leu Thr Ser Val His Val Ala Val Arg Thr Thr Gly Lys 

35 40 45 

Val Asp Ser Leu Leu Gly He Thr Ser Ala Asp Val Asp Val Arg Ala 

50 55 60 

Asn Pro Leu Ala Ala Lys Gly Val Cys Thr Tyr Asn Asp Glu Gin Gly 
65 70 75 80 

Val Pro Phe Arg Val Gin Gly Asp Asn He Ser Val Lys Leu Phe Asp 

85 90 95 

Asp Trp Ser Asn Leu Gly Ser He Ser Glu Leu Ser Thr Ser Arg Val 

100 105 HO 

Leu Asp Pro Ala Ala Gly Val Thr Gin Leu Leu Ser Gly Val Thr Asn 

115 120 125 

Leu Gin Ala Gin Gly Thr Glu Val He Asp Gly He Ser Thr Thr Lys 

130 135 140 

He Thr Gly Thr He Pro Ala Ser Ser Val Lys Met Leu Asp Pro Gly 
145 150 155 160 

Ala Lys Ser Ala Arg Pro Ala Thr Val Trp He Ala Gin Asp Gly Ser 

165 170 175 

His His Leu Val Arg Ala Ser He Asp Leu Gly Ser Gly Ser He Gin 

180 185 190 

Leu Thr Gin Ser Lys Trp Asn Glu Pro Val Asn Val Asp 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Gly Asp Ser Phe Trp Ala Ala Ala Asp Gin Met Ala Arg Gly Phe Val 

15 10 15 

Leu Gly Ala Thr Ala Gly Arg Thr Thr Leu Thr Gly Glu Gly Leu Gin 

20 25 " 30 

His Ala Asp Gly His Ser Leu Leu Leu Asp Ala Thr Asn Pro Ala Val 

35 40 45 

Val Ala Tyr Asp Pro Ala Phe Ala Tyr Glu He Gly Tyr He Xaa Glu 

50 55 60 

Ser Gly Leu Ala Arg Met Cys Gly Glu Asn Pro Glu Asn He Phe Phe 
65 70 75 80 

Tyr He Thr Val Tyr Asn Glu Pro Tyr Val Gin Pro Pro Glu Pro Glu 

85 90 95 

Asn Phe Asp Pro Glu Gly Val Leu Gly Gly He Tyr Arg Tyr His Ala 

100 105 110 

Ala Thr Glu Gin Arg Thr Asn Lys Xaa Gin He Leu Ala Ser Gly Val 

115 120 125 

Ala Met Pro Ala Ala Leu Arg Ala Ala Gin Met Leu Ala Ala Glu Trp 

130 135 140 

Asp Val Ala Ala Asp Val Trp Ser Val Thr Ser Trp Gly Glu Leu Asn 
145 150 155 160 

Arg Asp Gly Val Val He Glu Thr Glu Lys Leu Arg His Pro Asp Arg 

165 170 175 

Pro Ala Gly Val Pro Tyr Val Thr Arg Ala Leu Glu Asn Ala Arg Gly 

180 185 190 

Pro Val He Ala Val Ser Asp Trp Met Arg Ala Val Pro Glu Gin He 

195 200 205 

Arg Pro Trp Val Pro Gly Thr Tyr Leu Thr Leu Gly Thr Asp Gly Phe 

210 215 220 

Gly Phe Ser Asp Thr Arg Pro Ala Gly Arg Arg Tyr Phe Asn Thr Asp 
225 ~ 230 235 240 

Ala Glu Ser Gin Val Gly Arg Gly Phe Gly Arg Gly Trp Pro Gly Arg 

245 250 255 

Arg Val Asn He Asp Pro Phe Gly Ala Gly Arg Gly Pro Pro Ala Gin 

260 265 270 

Leu Pro Gly Phe Asp Glu Gly Gly Gly Leu Arg Pro Xaa Lys 
275 280 285 



INFORMATION FOR SEQ ID NO: 82: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Thr Lys Phe His Ala Leu Met Gin Glu Gin He His Asn Glu Phe Thr 

15 10 15 

Ala Ala Gin Gin Tyr Val Ala He Ala Val Tyr Phe Asp Ser Glu Asp 

20 25 30 

Leu Pro Gin Leu Ala Lys His Phe Tyr Ser Gin Ala Val Glu Glu Arg 



Ill 



Asn 


His 


Ala 


Met 


Met 


Leu 


Val 


Gin His Leu 


Leu Asp Arg Asp Leu Arg 




50 










55 




60 


Val 


Glu 


He 


Pro 


Gly 


Val 


Asp 


Thr Val Arg 


Asn Gin Phe Asp Arg Pro 


65 










70 






75 80 


Arg 


Glu 


Ala 


Leu 


Ala 


Leu 


Ala 


Leu Asp Gin 


Glu Arg Thr Val Thr Asp 










85 






90 


95 


Gin 


Val 


Gly 


Arg 


Leu 


Thr 


Ala 


Val Ala Arg 


Asp Glu Gly Asp Phe Leu 








100 








105 


110 


Gly 


Glu 


Gin 


Phe 


Met 


Gin 


Trp 


Phe Leu Gin 


Glu Gin He Glu Glu Val 






115 










120 


125 


Ala 


Leu 


Met 


Ala 


Thr 


Leu 


Val 


Arg Val Ala 


Asp Arg Ala Gly Ala Asn 




130 










135 




140 


Leu 


Phe 


Glu 


Leu 


Glu 


Asn 


Phe 


Val Ala Arg 


Glu Val Asp Val Ala Pro 


145 










150 






155 160 


Ala 


Ala 


Ser 


Gly 


Ala 


Pro 


His 


Ala Ala Gly 


Gly Arg Leu 










165 






170 





(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 





Arg 


Ala 


Asp 


Glu 


Arg 


Lys 


Asn 


Thr 


Thr Met 


Lys Met Val Lys Ser He 




1 








5 








10 


15 




Ala 


Ala 


Gly 


Leu 


Thr 


Ala 


Ala 


Ala 


Ala He 


Gly Ala Ala Ala Ala Gly 










20 










25 


30 




Val 


Thr 


Ser 


lie 


Met 


Ala 


Gly 


Gly 


Pro Val 


Val Tyr Gin Met Gin Pro 








35 










40 




45 




Val 


Val 


Phe 


Gly 


Ala 


Pro 


Leu 


Pro 


Leu Asp 


Pro Xaa Ser Ala Pro Xaa 






50 










55 






60 




Val 


Pro 


Thr 


Ala 


Ala 


Gin 


Trp 


Thr 


Xaa Leu 


Leu Asn Xaa Leu Xaa Asp 




65 










70 








75 80 




Pro 


Asn 


Val 


Ser 


Phe 


Xaa 


Asn 


Lys 


Gly Ser 


Leu Val Glu Gly Gly He 












85 








90 


95 




Gly 


Gly 


Xaa 


Glu 


Gly 


Xaa 


Xaa 


Arg 


Arg Xaa 


Gin 



100 105 

(2) INFORMATION FOR SEQ ID NO: 84: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Val Leu Ser Val Pro Val Gly Asp Gly Phe Trp Xaa Arg Val Val Asn 

15 10 15 

Pro Leu Gly Gin Pro He Asp Gly Arg Gly Asp Val Asp Ser Asp Thr 
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20 25 30 



Arg 


Arg 


Ala 




Glu 




Gin 


Ala 




Ser Val Val Xaa Arg Gin Gly 






35 










40 




45 


Val 


Lys 


Glu 


Pro 


Leu 


Xaa 


Thr 


Gly 


He 


Lys Ala He Asp Ala Met Thr 




50 










55 






60 


Pro 


He 


Gly 


Arg 


Gly 


Gin 


Arg 


Gin 


Leu 


He He Gly Asp Arg Lys Thr 


65 










70 








75 " 80 


Gly 


Lys 


Asn 


Arg 


Arg 


Leu 


Cys 


Arg 


Thr 


Pro Ser Ser Asn Gin Arg Glu 










85 










90 95 


Glu 


Leu 


Gly 


Val 


Arg 


Trp 


He 


Pro 


Arg 


Ser Arg Cys Ala Cys Val Tyr 








100 










105 


110 


Val 


Gly 


His 


Arg 


Ala 


Arg 


Arg 


Gly 


Thr 


Tyr His Arg Arg 






115 










120 




125 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 



Cys 


Asp 


Ala 


Val 


Met 


Gly 


Phe 


Leu 


Gly Gly Ala Gly Pro Leu Ala Val 


1 








5 








10 15 


Val 


Asp 


Gin 


Gin 


Leu 


Val 


Thr 


Arg 


Val Pro Gin Gly Trp Ser Phe Ala 








20 










25 30 


Gin 


Ala 


Ala 


Ala 


Val 


Pro 


Val 


Val 


Phe Leu Thr Ala Trp Tyr Gly Leu 






35 










40 


45 


Ala 


Asp 


Leu 


Ala 


Glu 


He 


Lys 


Ala 


Gly Glu Ser Val Leu He His Ala 




50 










55 




60 


Gly 


Thr 


Gly 


Gly 


Val 


Gly 


Met 


Ala 


Ala Val Gin Leu Ala Arg Gin Trp 


65 










70 






75 80 


Gly 


Val 


Glu 


Val 


Phe 


Val 


Thr 


Ala 


Ser Arg Gly Lys Trp Asp Thr Leu 










85 








90 95 


Arg 


Ala 


Xaa 


Xaa 


Phe 


Asp 


Asp 


Xaa 


Pro Tyr Arg Xaa Phe Pro His Xaa 








100 










105 110 


Arg 


Ser 


Ser 


Xaa 


Gly 











115 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Met Tyr Arg Phe Ala Cys Arg Thr Leu Met Leu Ala Ala Cys He Leu 
15 10 15 

Ala Thr Gly Val Ala Gly Leu Gly Val Gly Ala Gin Ser Ala Ala Gin 
20 25 30 



113 



Thr Ala Pro Val Pro Asp Tyr Tyr 

35 40 
Pro Ala Trp Gly Pro Asn Trp Asp 

50 55 
His Arg Asp Ser Asp Gly Pro Asp 
65 70 
lie Leu Glu Gly Pro Val Leu Asp 
85 

Pro Ala Ala Gly Gly Gly Ala 

100 

INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: SI 


:q ID NO 


87: 






Val 


Gin Cys Arg Val Trp Leu 


Glu 


He Gin Trp Arg Gly Met 


Leu Gly 


1 


5 






10 




15 


Ala 


Asp Gin Ala Arg Ala Gly 


Gly 


Pro 


Ala Arg He 


Trp Arg 


Glu His 




20 




25 




30 




Ser 


Met Ala Ala Met Lys Pro 


Arg 


Thr 


Gly Asp Gly 


Pro Leu 


Glu Ala 




35 


40 






45 




Thr 


Lys Glu Gly Arg Gly He 


Val 


Met 


Arg Val Pro 


Leu Glu Gly Gly 




50 55 






60 






Gly 


Arg Leu Val Val Glu Leu 


Thr 


Pro 


Asp Glu Ala 


Ala Ala 


Leu Gly 


65 


70 






75 




80 


Asp 


Glu Leu Lys Gly Val Thr 


Ser 











85 



INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Thr Asp Ala Ala Thr Leu Ala Gin Glu Ala Gly Asn Phe Glu Arg He 

15 10 15 

Ser Gly Asp Leu Lys Thr Gin He Asp Gin Val Glu Ser Thr Ala Gly 

20 25 30 

Ser Leu Gin Gly Gin Trp Arg Gly Ala Ala Gly Thr Ala Ala Gin Ala 

35 40 45 

Ala Val Val Arg Phe Gin Glu Ala Ala Asn Lys Gin Lys Gin Glu Leu 

50 55 60 

Asp Glu He Ser Thr Asn He Arg Gin Ala Gly Val Gin Tyr Ser Arg 
65 70 75 80 

Ala Asp Glu Glu Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe 



Trp Cys Pro Gly Gin Pro Phe Asp 
45 

Pro Tyr Thr Cys His Asp Asp Phe 
60 

His Ser Arg Asp Tyr Pro Gly Pro 

75 80 
Asp Pro Gly Ala Ala Pro Pro Pro 
90 95 
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85 90 95 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



Met 


Thr 


Gin 


Ser 


Gin 


Thr 


Val 


Thr Val Asp Gin Gin Glu lie Leu Asn 


1 








5 






10 15 


Arg 


Ala 


Asn 


Glu 


Val 


Glu 


Ala 


Pro Met Ala Asp Pro Pro Thr . Asp Val 








20 








25 30 


Pro 


lie 


Thr 


Pro 


Cys 


Glu 


Leu 


Thr Xaa Xaa Lys Asn Ala Ala Gin Gin 






35 










40 45 


Xaa 


Val 


Leu 


Ser 


Ala 


Asp 


Asn 


Met Arg Glu Tyr Leu Ala Ala Gly Ala 




50 










55 


60 


Lys 


Glu 


Arg 


Gin 


Arg 


Leu 


Ala 


Thr Ser Leu Arg Asn Ala Ala Lys Xaa 


65 










70 




75 80 


Tyr 


Gly 


Glu 


Val 


Asp 


Glu 


Glu 


Ala Ala Thr Ala Leu Asp Asn Asp Gly 










85 






90 95 


Glu 


Gly 


Thr 


Val 


Gin 


Ala 


Glu 


Ser Ala Gly Ala Val Gly Gly Asp Ser 








100 








105 110 


Ser 


Ala 


Glu 


Leu 


Thr 


Asp 


Thr 


Pro Arg Val Ala Thr Ala Gly Glu Pro 






115 










120 125 


Asn 


Phe 


Met 


Asp 


Leu 


Lys 


Glu 


Ala Ala Arg Lys Leu Glu Thr Gly Asp 




130 










135 


140 


Gin 


Gly 


Ala 


Ser 


Leu 


Ala 


His 


Xaa Gly Asp Gly Trp Asn Thr Xaa Thr 


145 










150 




155 160 


Leu 


Thr 


Leu 


Gin 


Gly 


Asp 
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(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Arg Ala Glu Arg Met 
1 5 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Val Ala Trp Met Ser Val Thr Ala Gly Gin Ala Glu Leu Thr Ala Ala 

1^5 10 15 

Gin Val Arg Val Ala Ala Ala Ala Tyr Glu Thr Ala Tyr Gly Leu Thr 

20 25 30 

Val Pro Pro Pro Val He Ala Glu Asn Arg Ala Glu Leu Met He Leu 

35 40 45 

He Ala Thr Asn Leu Leu Gly Gin Asn Thr Pro Ala He Ala Val Asn 

50 55 60 

Glu Ala Glu Tyr Gly Glu Met Trp Ala Gin Asp Ala Ala Ala Met Phe 
65 " 70 75 80 

Gly Tyr Ala Ala Ala Thr Ala Thr Ala Thr Ala Thr Leu Leu Pro Phe 

85 90 ?5 

Glu Glu Ala Pro Glu Met Thr Ser Ala Gly Gly Leu Leu Glu Gin Ala 

100 105 HO 

Ala Ala Val Glu Glu Ala Ser Asp Thr Ala Ala Ala Asn Gin Leu Met 

115 120 125 

Asn Asn Val Pro Gin Ala Leu Lys Gin Leu Ala Gin Pro Thr Gin Gly 

130 135 140 

Thr Thr Pro Ser Ser Lys Leu Gly Gly Leu Trp Lys Thr Val Ser Pro 
145 150 155 160 

His Arg Ser Pro He Ser Asn Met Val Ser Met Ala Asn Asn His Met 

165 170 175 

Ser Met Thr Asn Ser Gly Val Ser Met Thr Asn Thr Leu Ser Ser Met 

180 185 190 

Leu Lys Gly Phe Ala Pro Ala Ala Ala Ala Gin Ala Val Gin Thr Ala 

195 200 205 

Ala Gin Asn Gly Val Arg Ala Met Ser Ser Leu Gly Ser Ser Leu Gly 

210 215 220 

Ser Ser Gly Leu Gly Gly Gly Val Ala Ala Asn Leu Gly Arg Ala Ala 
225 * 230 235 240 

Ser Val Arg Tyr Gly His Arg Asp Gly Gly Lys Tyr Ala Xaa Ser Gly 

245 250 255 

Arg Arg Asn Gly Gly Pro Ala 
260 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Met Thr Tyr Ser Pro Gly Asn Pro Gly Tyr Pro Gin Ala Gin Pro Ala 

15 10 15 

Gly Ser Tyr Gly Gly Val Thr Pro Ser Phe Ala His Ala Asp Glu Gly 

20 ' 25 30 

Ala Ser Lys Leu Pro Met Tyr Leu Asn He Ala Val Ala Val Leu Gly 

35 40 45 

Leu Ala Ala Tyr Phe Ala Ser Phe Gly Pro Met Phe Thr Leu Ser Thr 
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Glu Leu Gly Gly Gly Asp Gly Ala Val Ser Gly Asp Thr Gly Leu Pro 
65 ' ^ 70 75 80 

Val Gly Val Ala Leu Leu Ala Ala Leu Leu Ala Gly Val Val Leu Val 

85 90 95 

Pro Lys Ala Lys Ser His Val Thr Val Val Ala Val Leu Gly Val Leu 

100 105 " HO 

Gly Val Phe Leu Met Val Ser Ala Thr Phe Asn Lys Pro Ser Ala Tyr 

115 120 125 

Ser Thr Gly Trp Ala Leu Trp Val Val Leu Ala Phe He Val Phe Gin 

130 135 140 

Ala Val Ala Ala Val Leu Ala Leu Leu Val Glu Thr Gly Ala He Thr 
145 150 155 160 

Ala Pro Ala Pro Arg Pro Lys Phe Asp Pro Tyr Gly Gin Tyr Gly Arg 

165 170 175 

Tyr Gly Gin Tyr Gly Gin Tyr Gly Val Gin Pro Gly Gly Tyr Tyr Gly 

180 185 190 

Gin Gin Gly Ala Gin Gin Ala Ala Gly Leu Gin Ser Pro Gly Pro Gin 

195 200 205 

Gin Ser Pro Gin Pro Pro Gly Tyr Gly Ser Gin Tyr Gly Gly Tyr Ser 

210 215 220 

Ser Ser Pro Ser Gin Ser Gly Ser Gly Tyr Thr Ala Gin Pro Pro Ala 
225 230 235 240 

Gin Pro Pro Ala Gin Ser Gly Ser Gin Gin Ser His Gin Gly Pro Ser 

245 250 255 

Thr Pro Pro Thr Gly Phe Pro Ser Phe Ser Pro Pro Pro Pro Val Ser 

260 265 270 

Ala Gly Thr Gly Ser Gin Ala Gly Ser Ala Pro Val Asn Tyr Ser Asn 

275 280 285 

Pro Ser Gly Gly Glu Gin Ser Ser Ser Pro Gly Gly Ala Pro Val 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Gly Cys Gly Glu Thr Asp Ala Ala Thr Leu Ala Gin Glu Ala Gly Asn 
15 10 15 

Phe Glu Arg He Ser Gly Asp Leu Lys Thr Gin He 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Asp Gin Val Glu Ser Thr Ala Gly Ser Leu Gin Gly Gin Trp Arg Gly 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Gly Cys Gly Ser Thr Ala Gly Ser Leu Gin Gly Gin Trp Arg Gly Ala 
! 5 10 15 

Ala Gly Thr Ala Ala Gin Ala Ala Val Val Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Gly Cys Gly Gly Thr Ala Ala Gin Ala Ala Val Val Arg Phe Gin Glu 
15 10 15 

Ala Ala Asn Lys Gin Lys Gin Glu Leu Asp Glu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Gly Cys Gly Ala Asn Lys Gin Lys Gin Glu Leu Asp Glu lie Ser Thr 
15 10 15 

Asn He Arg Gin Ala Gly Val Gin Tyr Ser Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 98: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Gly Cys Gly lie Arg Gin Ala Gly Val Gin Tyr Ser Arg Ala Asp Glu 
15 10 15 

Glu Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe 
20 25 



(2) INFORMATION FOR SEQ ID NO: 99: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



ATGAAGATGG TGAAATCGAT CGCCGCAGGT CTGACCGCCG CGGCTGCAAT CGGCGCCGCT 60 

GCGGCCGGTG TGACTTC GAT CATGGCTGGC GGCCCGGTCG TATACCAGAT GCAGCCGGTC 120 

GTCTTCGGCG CGCCACTGCC GTTGGACCCG GCATCCGCCC CTGACGTCCC GACCGCCGCC 180 

CAGTTGACCA GCCTGCTCAA CAGCCTCGCC GATCCCAACG TGTCGTTTGC GAACAAGGGC 240 

AGTCTGGTCG AGGGCGGCAT CGGGGGCACC GAGGCGCGCA TCGCCGACCA CAAGCTGAAG 300 

AAGGCCGCCG AGCACGGGGA TCTGCCGCTG TCGTTCAGCG TGACGAACAT CCAGCCGGCG 360 

GCCGCCGGTT CGGCCACCGC CGACGTTTCC GTCTCGGGTC CGAAGCTCTC GTCGCCGGTC 42 0 

ACGCAGAACG TCACGTTCGT GAATCAAGGC GGCTGGATGC TGTCACGCGC ATCGGCGATG 48 0 

GAGTTGCTGC AGGCCGCAGG GAACTGA 507 



(2) INFORMATION FOR SEQ ID NO: 100: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 100: 

Met Lys Met Val Lys Ser He Ala Ala Gly Leu Thr Ala Ala Ala Ala 

15 10 15 

He Gly Ala Ala Ala Ala Gly Val Thr Ser He Met Ala Gly Gly Pro 

20 25 30 

Val Val Tyr Gin Met Gin Pro Val Val Phe Gly Ala Pro Leu Pro Leu 

35 40 45 

Asp Pro Ala Ser Ala Pro Asp Val Pro Thr Ala Ala Gin Leu Thr Ser 

50 55 60 

Leu Leu Asn Ser Leu Ala Asp Pro Asn Val Ser Phe Ala Asn Lys Gly 
65 70 75 80 

Ser Leu Val Glu Gly Gly He Gly Gly Thr Glu Ala Arg He Ala Asp 

85 90 95 

His Lys Leu Lys Lys Ala Ala Glu His Gly Asp Leu Pro Leu Ser Phe 
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100 

Ser Val Thr Asn He Gin Pro Ala 
115 120 
Val Ser Val Ser Gly Pro Lys Leu 

130 135 
Thr Phe Val Asn Gin Gly Gly Trp 
145 150 
Glu Leu Leu Gin Ala Ala Gly Asn 
165 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CGTGGCAATG TCGTTGACCG TCGGGGCCGG GGTCGCCTCC GCAGATCCCG TGGACGCGGT 60 

CATTAACACC ACCTGCAATT ACGGGCAGGT AGTAGCTGCG CTCAACGCGA CGGATCCGGG 120 

GGCTGCCGCA CAGTTCAACG CCTCACCGGT GGCGCAGTCC TATTTGCGCA ATTTCCTCGC 180 

CGCACCGCCA CCTCAGCGCG CTGCCATGGC CGCGCAATTG CAAGCTGTGC CGGGGGCGGC 24 0 

ACAGTACATC GGCCTTGTCG AGTCGGTTGC CGGCTCCTGC AACAACTATT AAGCCCATGC 300 

GGGCCCCATC CCGCGACCCG GCATCGTCGC CGGGGCTAGG CCAGATTGCC CCGCTCCTCA 360 

ACGGGCCGCA TCCCGCGACC CGGCATCGTC GCCGGGGCTA GGCCAGATTG CCCCGCTCCT 42 0 

CAACGGGCCG CATCTCGTGC CGAATTCCTG CAGCCCGGGG GATCCACTAG TTCTAGAGCG 48 0 

GCCGCCACCG CGGTGGAGCT 500 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Val Ala Met Ser Leu Thr Val Gly Ala Gly Val Ala Ser Ala Asp Pro 

15 10 15 

Val Asp Ala Val He Asn Thr Thr Cys Asn Tyr Gly Gin Val Val Ala 

20 25 30 

Ala Leu Asn Ala Thr Asp Pro Gly Ala Ala Ala Gin Phe Asn Ala Ser 

35 40 45 

Pro Val Ala Gin Ser Tyr Leu Arg Asn Phe Leu Ala Ala Pro Pro Pro 

50 55 60 

Gin Arg Ala Ala Met Ala Ala Gin Leu Gin Ala Val Pro Gly Ala Ala 
65 70 75 80 

Gin Tyr He Gly Leu Val Glu Ser Val Ala Gly Ser Cys Asn Asn Tyr 
85 90 95 



105 110 
Ala Ala Gly Ser Ala Thr Ala Asp 
125 

Ser Ser Pro Val Thr Gin Asn Val 
140 

Met Leu Ser Arg Ala Ser Ala Met 
155 160 



(2) INFORMATION FOR SEQ ID NO: 103: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 154 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

ATGACAGAGC AGCAGTGGAA TTTCGCGGGT ATCGAGGCCG CGGCAAGCGC AATCCAGGGA 60 
AATGTCACGT CCATTCATTC CCTCCTTGAC GAGGGGAAGC AGTCCCTGAC CAAGCTCGCA 120 
GCGGCCTGGG GCGGTAGCGG TTCGGAAGCG TACC 154 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Met Thr Glu Gin Gin Trp Asn Phe Ala Gly lie Glu Ala Ala Ala Ser 

15 10 15 

Ala He Gin Gly Asn Val Thr Ser He His Ser Leu Leu Asp Glu Gly 

20 25 30 

Lys Gin Ser Leu Thr Lys Leu Ala Ala Ala Trp Gly Gly Ser Gly Ser 

35 40 45 

Glu Ala Tyr 
50 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CGGTCGCGCA CTTCCAGGTG ACTATGAAAG TCGGCTTCCG NCTGGAGGAT TCCTGAACCT 60 

TCAAGCGCGG CCGATAACTG AGGTGCATCA TTAAGCGACT TTTCCAGAAC ATCCTGACGC 120 

GCTCGAAACG CGGCACAGCC GACGGTGGCT CCGNCGAGGC GCTGNCTCCA AAATCCCTGA 18 0 

GACAATTCGN CGGGGGCGCC TACAAGGAAG TCGGTGCTGA ATTCGNCGNG TATCTGGTCG 24 0 

ACCTGTGTGG TCTGNAGCCG GACGAAGCGG TGCTCGACGT CG 282 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3058 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

GATCGTACCC GTGCGAGTGC TCGGGCCGTT TGAGGATGGA GTGCACGTGT CTTTCGTGAT 60 

GG CAT AC C CA GAGATGTTGG CGGCGGCGGC TGACACCCTG CAGAGCATCG GTGCTACCAC 120 

TGTGGCTAGC AATGCCGCTG CGGCGGCCCC GACGACTGGG GTGGTGCCCC CCGCTGCCGA 180 

TGAGGTGTCG GCGCTGACTG CGGCGCACTT CGCCGCACAT GCGGCGATGT ATCAGTCCGT 240 

GAGCGCTCGG GCTGCTGCGA TTCATGACCA GTTCGTGGCC ACCCTTGCCA GCAGCGCCAG 3 00 

CTCGTATGCG GCCACTGAAG TCGCCAATGC GGCGGCGGCC AGCTAAGCCA GGAACAGTCG 3 60 

GCACGAGAAA CCACGAGAAA TAGGGACACG TAATGGTGGA TTTCGGGGCG TTACCACCGG 42 0 

AGATCAACTC CGCGAGGATG TACGCCGGCC CGGGTTCGGC CTCGCTGGTG GCCGCGGCTC 48 0 

AGATGTGGGA CAGCGTGGCG AGTGAC CTGT TTTCGGCCGC GTCGGCGTTT CAGTCGGTGG 540 

TCTGGGGTCT GACGGTGGGG TCGTGGATAG GTTCGTCGGC GGGTCTGATG GTGGCGGCGG 600 

CCTCGCCGTA TGTGGCGTGG ATGAGCGTCA CCGCGGGGCA GGCCGAGCTG ACCGCCGCCC 66 0 

AGGTCCGGGT TGCTGCGGCG GCCTACGAGA CGGCGTATGG GCTGACGGTG CCCCCGCCGG 72 0 

TGATCGCCGA GAAC CGTGCT GAACTGATGA TTCTGATAGC GACCAACCTC TTGGGGCAAA 780 

ACACCCCGGC GATCGCGGTC AACGAGGCCG AATACGGCGA GATGTGGGCC CAAGACGCCG 840 

CCGCGATGTT TGGCTACGCC GCGGCGACGG CGACGGCGAC GGCGACGTTG CTGCCGTTCG 900 

AGGAGGCGCC GGAGATGACC AGCGCGGGTG GGCTCCTCGA GCAGGCCGCC GCGGTCGAGG 960 

AGGCCTCCGA CACCGCCGCG GCGAACCAGT TGATGAACAA TGTGCCCCAG GCGCTGCAAC 1020 

AGCTGGCCCA GCCCACGCAG GGCACCACGC CTTCTTCCAA GCTGGGTGGC CTGTGGAAGA 1080 

CGGTCTCGCC GCATCGGTCG CCGATCAGCA ACATGGTGTC GATGGCCAAC AACCACATGT 1140 

CGATGACCAA CTCGGGTGTG TCGATGACCA ACACCTTGAG CTCGATGTTG AAGGGCTTTG 1200 

CTCCGGCGGC GGCCGCCCAG GCC GTGCAAA CCGCGGCGCA AAACGGGGTC CGGGCGATGA 1260 

GCTCGCTGGG CAGCTCGCTG GGTTCTTCGG GTCTGGGCGG TGGGGTGGCC GCCAACTTGG 1320 

GTCGGGCGGC CTCGGTCGGT TCGTTGTCGG TGCCGCAGGC CTGGGCCGCG GCCAACCAGG 138 0 

CAGTCACCCC GGCGGCGCGG GCGCTGCCGC TGACCAGCCT GACCAGCGCC GCGGAAAGAG 1440 

GGCCCGGGCA GATG CTGGGC GGGCTGCCGG TGGGGCAGAT GGGCGCCAGG GCCGGTGGTG 1500 

GGCTCAGTGG TGTGCTGCGT GTTCCGCCGC GACCCTATGT GATGCCGCAT TCTCCGGCGG 156 0 

CCGGCTAGGA GAGGGGGCGC AGACTGTCGT TATTTGACCA GTGATCGGCG GTCTCGGTGT 1620 

" V HUM I hi limn ii ni MM 1 1 
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TTCCGCGGCC GGCTATGACA ACAGTCAATG TGCATGACAA GTTACAGGTA TTAGGTC CAG 168 0 

GTTCAACAAG GAGACAGGCA ACATGGCCTC ACGTTTTATG ACGGATCCGC ACGCGATGCG 1740 

GGACATGGCG GGCCGTTTTG AGGTGCACGC CCAGACGGTG GAGGACGAGS CTCGCCGGAT 1800 

GTGGGCGTCC GCGCAAAACA TTTCCGGTGC GGGCTGGAGT GGCATGGCCG AGGCGACCTC 1860 

GCTAGACACC ATGGCCCAGA TGAATCAGGC GTTTCGCAAC ATCGTGAACA TGCTGCACGG 192 0 

GGTGCGTGAC GGGCTGGTTC GCGACGCCAA CAACTACGAG CAGCAAGAGC AGGCCTCCCA 1980 

GCAGATCCTC AG CAGCTAAC GTCAGCCGCT GCAGCACAAT ACTTTTACAA GCGAAGGAGA 2040 

ACAGGTTCGA TGACCATCAA CTATCAATTC GGGGATGTCG ACGCTCACGG CGCCATGATC 2100 

CGCGCTCAGG CCGGGTTGCT GGAGGCCGAG CATCAGGCCA TCATTCGTGA TGTGTTGACC 2160 

GCGAGTGACT TTTGGGGCGG CGCCGGTTCG GCGGCCTGCC AGGGGTTCAT TACCCAGTTG 2220 

GGCCGTAACT TCCAGGTGAT CTACGAGCAG GCCAACGCCC ACGGGCAGAA GGTGCAGGCT 2280 

GCCGGCAACA ACATGGCGCA AACCGACAGC GCCGTCGGCT CCAGCTGGGC CTGACACCAG 2340 

GCCAAGGCCA GGGACGTGGT GTACGAGTGA AGTTCCTCGC GTGATCCTTC GGGTGGCAGT 2400 

CTAAGTGGTC AGTGCTGGGG TGTTGGTGGT TTGCTGCTTG GCGGGTTCTT CGGTGCTGGT 246 0 

CAGTGCTGCT CGGGCTCGGG TGAGGACCTC GAGGCCCAGG TAGCGCCGTC CTTCGATCCA 2520 

TTCGTCGTGT TGTTCGGCGA GGACGGCTCC GACGAGGCGG ATGATCGAGG CGCGGTCGGG 2S80 

GAAGATGCCC ACGACGTCGG TTCGGCGTCG TACCTCTCGG TTGAGGCGTT CCTGGGGGTT 2640 

GTTGGACCAG ATTTGGCGCC AGATCTGCTT GGGGAAGGCG GTGAACGCCA GCAGGTCGGT 2700 

GCGGGCGGTG TCGAGGTGCT CGGCCACCGC GGGGAGTTTG TCGGTCAGAG CGTCGAGTAC 2760 

CCGATCATAT TGGGCAACAA CTGATTCGGC GTCGGGCTGG TCGTAGATGG AGTGCAGCAG 2820 

GGTGCGCACC CACGGCCAGG AGGGCTTCGG GGTGGCTGCC ATCAGATTGG CTGCGTAGTG 2880 

GGTTCTGCAG CGCTGCCAGG CCGCTGCGGG CAGGGTGGCG CCGATCGCGG CCACCAGGCC 2940 

GGCGTGGGCG TCGCTGGTGA CCAGCGCGAC CCCGGACAGG CCGCGGGCGA CCAGGTCGCG 3000 

GAAGAACGCC AGCCAGCCGG CCCCGTCCTC GGCGGAGGTG ACCTGGATGC CCAGGATC 3058 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 



123 



(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

Met Val Asp Phe Gly Ala Leu Pro Pro Glu He Asn Ser Ala Arg Met 
1 5 10 15 

Tyr Ala Gly Pro Gly Ser Ala Ser Leu Val Ala Ala Ala Gin Met Trp 
20 25 30 

Asp Ser Val Ala Ser Asp Leu Phe Ser Ala Ala Ser Ala Phe Gin Ser 
35 40 45 

Val Val Trp Gly Leu Thr Val Gly Ser Trp He Gly Ser Ser Ala Gly 
50 55 60 

Leu Met Val Ala Ala Ala Ser Pro Tyr Val Ala Trp Met Ser Val Thr 



Ala Gly Gin Ala Glu Leu Thr Ala Ala Gin Val Arg Val Ala Ala Ala 
85 90 95 

Ala Tyr Glu Thr Ala Tyr Gly Leu Thr Val Pro Pro Pro Val He Ala 
100 105 110 

Glu Asn Arg Ala Glu Leu Met He Leu He Ala Thr Asn Leu Leu Gly 
115 120 125 

Gin Asn Thr Pro Ala He Ala Val Asn Glu Ala Glu Tyr Gly Glu Met 
130 135 140 

Trp Ala Gin Asp Ala Ala Ala Met Phe Gly Tyr Ala Ala Ala Thr Ala 
145 150 155 160 

Thr Ala Thr Ala Thr Leu Leu Pro Phe Glu Glu Ala Pro Glu Met Thr 
165 170 175 

Ser Ala Gly Gly Leu Leu Glu Gin Ala Ala Ala Val Glu Glu Ala Ser 
180 185 190 

Asp Thr Ala Ala Ala Asn Gin Leu Met Asn Asn Val Pro Gin Ala Leu 
195 200 205 

Gin Gin Leu Ala Gin Pro Thr Gin Gly Thr Thr Pro Ser Ser Lys Leu 
210 215 220 

Gly Gly Leu Trp Lys Thr Val Ser Pro His Arg Ser Pro He Ser Asn 
225 230 235 240 

Met Val Ser Met Ala Asn Asn His Met Ser Met Thr Asn Ser Gly Val 
245 250 255 

Ser Met Thr Asn Thr Leu Ser Ser Met Leu Lys Gly Phe Ala Pro Ala 
260 265 270 
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Ala Ala Ala Gin Ala Val Gin Thr Ala Ala Gin Asn Gly Val Arg Ala 
275 280 285 

Met Ser Ser Leu Gly Ser Ser Leu Gly Ser Ser Gly Leu Gly Gly Gly 
290 295 300 

Val Ala Ala Asn Leu Gly Arg Ala Ala Ser Val Gly Ser Leu Ser Val 
305 310 315 320 

Pro Gin Ala Trp Ala Ala Ala Asn Gin Ala Val Thr Pro Ala Ala Arg 
325 330 335 

Ala Leu Pro Leu Thr Ser Leu Thr Ser Ala Ala Glu Arg Gly Pro Gly 
340 345 350 

Gin Met Leu Gly Gly Leu Pro Val Gly Gin Met Gly Ala Arg Ala Gly 
355 360 365 

Gly Gly Leu Ser Gly Val Leu Arg Val Pro Pro Arg Pro Tyr Val Met 
370 375 380 

Pro His Ser Pro Ala Ala Gly 
385 390 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY:' linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

GACGTCAGCA CCCGCCGTGC AGGGCTGGAG CGTGGTCGGT TTTGATCTGC GGTCAAGGTG 60 

ACGTCCCTCG GCGTGTCGCC GGCGTGGATG CAGACTCGAT GCCGCTCTTT AGTGCAACTA 120 

ATTTCGTTGA AGTGCCTGCG AGGTATAGGA CTTCACGATT GGTTAATGTA GCGTTCACCC 180 

CGTGTTGGGG TCGATTTGGC CGGACCAGTC GTCAC CAACG CTTGGCGTGC GCGCCAGGCG 240 

GGCGATCAGA TCGCTTGACT AC C AATCAAT CTTGAGCTCC CGGGCCGATG CTCGGGCTAA 300 

ATGAGGAGGA GCACGCGTGT CTTTCACTGC GCAACCGGAG ATGTTGGCGG CCGCGGCTGG 360 

CGAACTTCGT TCCCTGGGGG CAACGCTGAA GGCTAGCAAT GCCGCCGCAG CCGTGCCGAC 420 

GACTGGGGTG GTGCCCCCGG CTGCCGACGA GGTGTCGCTG CTGCTTGCCA CACAATTCCG 480 

TACGCATGCG GCGACGTATC AGACGGCCAG CGCCAAGGCC GCGGTGATCC ATGAGCAGTT 540 

TGTGACCACG CTGGCCACCA GCGCTAGTTC ATATGCGGAC ACCGAGGCCG CCAACGCTGT 600 

GGTCACCGGC TAGCTGACCT GACGGTATTC GAGCGGAAGG ATTATCGAAG TGGTGGATTT 660 
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CGGGGCGTTA 


CCACCGGAGA TCAACTCCGC 


GAGGATGTAC 


GCCGGCCCGG GTTCGGCCTC 


720 


GCTGGTGGCC 


GCCGCGAAGA TGTGGGACAG 


CGTGGCGAGT 


GACCTGTTTT 


CGGCCGCGTC 


780 


GGCGTTTCAG 


TCGGTGGTCT GGGGTCTGAC 


GGTGGGGTCG TGGATAGGTT CGTCGGCGGG 


840 


TCTGATGGCG 


GCGGCGGCCT CGCCGTATGT 


GGCGTGGATG 


AGCGTCACCG 


CGGGGCAGGC 


900 


CCAGCTGACC 


GCCGCCCAGG TCCGGGTTGC 


TGCGGCGGCC 


TACGAGACAG 


CGTATAGGCT 


960 


GACGGTGCCC 


CCGCCGGTGA TCGCCGAGAA 


CCGTACCGAA 


CTGATGACGC 


TGACCGCGAC 


1020 


CAACCTCTTG 


GGGCAAAACA CGCCGGCGAT 


CGAGGCCAAT 


CAGGCCGCAT 


ACAGCCAGAT 


1080 


GTGGGGCCAA 


GACGCGGAGG CGATGTATGG 


CTACGCCGCC 


ACGGCGGCGA 


CGGCGACCGA 


1140 


GGCGTTGCTG 


CCGTTCGAGG ACGCCCCACT 


GATCACCAAC 


CCCGGCGGGC 


TCCTTGAGCA 


1200 


GGCCGTCGCG 


GTCGAGGAGG CCATCGACAC 


CGCCGCGGCG 


AACCAGTTGA 


TGAACAATGT 


1260 


GCCCCAAGCG 


CTGCAACAGC TGGCCCAGCC 


AGCGCAGGGC 


GTCGTACCTT 


CTTCCAAGCT 


1320 


GGGTGGGCTG 


TGGACGGCGG TCTCGCCGCA 


TCTGTCGCCG 


CTCAGCAACG 


TCAGTTCGAT 


1380 


AGCCAACAAC 


CACATGTCGA TGATGGGCAC 


GGGTGTGTCG 


ATG AC CAAC A 


CCTTGCACTC 


144 0 


GATGTTGAAG 


GGCTTAGCTC CGGCGGCGGC 


TCAGGCCGTG 


GAAACCGCGG 


CGGAAAACGG 


1500 


GGT CTGGGCG 


ATGAGCTCGC TGGGCAGCCA 


GCTGGGTTCG 


TCGCTGGGTT 


CTTCGGGTCT 


1560 


GGGCGCTGGG 


GTGGCCGCCA ACTTGGGTCG 


GGCGGCCTCG 


GTCGGTTCGT 


TGTCGGTGCC 


1620 


GCCAGCATGG 


GCCGCGGCCA ACCAGGCGGT 


CACCCCGGCG 


GCGCGGGCGC 


TGCCGCTGAC 


1680 


CAGCCTGACC 


AGCGCCGCCC AAACCGCCCC 


CGGACACATG 


CTGGG 




1725 


(2) INFORMATION FOR SEQ ID NO: 109: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Val Val Asp Phe Gly Ala Leu Pro Pro Glu lie Asn Ser Ala Arg Met 
15 10 15 

Tyr Ala Gly Pro Gly Ser Ala Ser Leu Val Ala Ala Ala Lys Met Trp 
20 25 30 

Asp Ser Val Ala Ser Asp Leu Phe Ser Ala Ala Ser Ala Phe Gin Ser 
35 40 45 
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Val Val Trp Gly Leu Thr Val Gly Ser Trp lie Gly Ser Ser Ala Gly 



Leu Met Ala Ala Ala Ala Ser Pro Tyr Val Ala Trp Met Ser Val Thr 
65 70 75 ' 80 

Ala Gly Gin Ala Gin Leu Thr Ala Ala Gin Val Arg Val Ala Ala Ala 



Ala Tyr Glu Thr Ala Tyr Arg Leu Thr Val Pro Pro Pro Val lie Ala 
100 105 110 

Glu Asn Arg Thr Glu Leu Met Thr Leu Thr Ala Thr Asn Leu Leu Gly 
115 120 125 

Gin Asn Thr Pro Ala lie Glu Ala Asn Gin Ala Ala Tyr Ser Gin Met 
130 135 140 

Trp Gly Gin Asp Ala Glu Ala Met Tyr Gly Tyr Ala Ala Thr Ala Ala 
145 150 155 160 

Thr Ala Thr Glu Ala Leu Leu Pro Phe Glu Asp Ala Pro Leu lie Thr 
165 170 175 

Asn Pro Gly Gly Leu Leu Glu Gin Ala Val Ala Val Glu Glu Ala lie 
180 185 190 

Asp Thr Ala Ala Ala Asn Gin Leu Met Asn Asn Val Pro Gin Ala Leu 
195 200 205 

Gin Gin Leu Ala Gin Pro Ala Gin Gly Val Val Pro Ser Ser Lys Leu 
210 215 220 

Gly Gly Leu Trp Thr Ala Val Ser Pro His Leu Ser Pro Leu Ser Asn 
225 230 235 240 

Val Ser Ser lie Ala Asn Asn His Met Ser Met Met Gly Thr Gly Val 
245 250 255 

Ser Met Thr Asn Thr Leu His Ser Met Leu Lys Gly Leu Ala Pro Ala 
260 265 270 

Ala Ala Gin Ala Val Glu Thr Ala Ala Glu Asn Gly Val Trp Ala Met 
275 280 285 

Ser Ser Leu Gly Ser Gin Leu Gly Ser Ser Leu Gly Ser Ser Gly Leu 
290 295 300 

Gly Ala Gly Val Ala Ala Asn Leu Gly Arg Ala Ala Ser Val Gly Ser 
305 310 315 320 



Leu Ser Val Pro Pro Ala Trp Ala Ala Ala Asn Gin Ala Val Thr Pro 
325 330 335 
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Ala Ala Arg Ala Leu Pro Leu Thr Ser Leu Thr Ser Ala Ala Gin Thr 
340 345 350 

Ala Pro Gly His Met Leu Gly 
355 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



AGTTCAGTCG AGAATGATAC 


TGACGGGCTG 


TATCCACGAT 


GGCTGAGACA ACCGAACCAC 


60 


CGTCGGACGC GGGGACATCG 


CAAGCCGACG 


CGATGGCGTT 


GGCCGCCGAA GCCGAAGCCG 


120 


CCGAAGCCGA 


AGCGCTGGCC 


GCCGCGGCGC 


GGGCCCGTGC 


CCGTGCCGCC CGGTTGAAGC 


180 


GTGAGGCGCT 


GGCGATGGCC 


CCAGCCGAGG 


ACGAGAACGT 


CCCCGAGGAT ATGCAGACTG 


240 


GGAAGACGCC 


GAAGACTATG 


ACGACTATGA 


CGACTATGAG 


GCCGCAGACC AGGAGGCCGC 


300 


ACGGTCGGCA 


TCCTGGCGAC 


GGCGGTTGCG 


GGTGCGGTTA 


CCAAGACTGT CCACGATTGC 


360 


CATGGCGGCC 


GCAGTCGTCA 


TCATCTGCGG 


CTTCACCGGG 


CTCAGCGGAT ACATTGTGTG 


420 


GCAACACCAT 


GAGGCCACCG 


AACGCCAGCA 


GCGCGCCGCG 


GCGTTCGCCG CCGGAGCCAA 


480 


GCAAGGTGTC 


ATCAACATGA 


CCTCGCTGGA 


CTTCAACAAG 


GCCAAAGAAG ACGTCGCGCG 


540 


TGTGATCGAC 


AGCTCCACCG 


GCGAATTCAG 


GGATGACTTC 


CAGCAGCGGG CAGCCGATTT 


600 


CACCAAGGTT 


GTCGAACAGT 


CCAAAGTGGT 


CACCGAAGGC 


ACGGTGAACG CGACAGCCGT 


660 


CGAATC CATG 


AACGAGCATT 


CCGCCGTGGT 


GCTCGTCGCG 


GCGACTTCAC GGGTCACCAA 


720 


TTCCGCTGGG 


GCGAAAGACG 


AACCACGTGC 


GTGGCGGCTC 


AAAGTGACCG TGACCGAAGA 


780 


GGGGGGACAG 


TACAAGATGT 


CGAAAGTTGA 


GTTCGTACCG 


TGACCGATGA CGTACGCGAC 


840 


GTCAACACCG 


AAACCACTGA 


CGCCACCGAA 


GTCGCTGAGA 


TCGACTCAGC CGCAGGCGAA 


900 


GCCGGTGATT 


CGGCGACCGA 


GGCATTTGAC 


ACCGACTCTG 


CAACGGAATC TACCGCGCAG 


960 


AAGGGTCAGC 


GGCACCGTGA 


CCTGTGGCGA 


ATGCAGGTTA 


CCTTGAAACC CGTTCCGGTG 


1020 


ATTCTCATCC 


TGCTCATGTT 


GATCTCTGGG 


GGCGCGACGG 


GATGGCTATA CCTTGAGCAA 


1080 


TACGACCCGA 


TCAGCAGACG 


GACTCCGGCG 


CCGCCCGTGC 


TGCCGTCGCC GCGGCGTCTG 


1140 


ACGGGACAAT 


CGCGCTGTTG 


TGTATTCACC 


CGACACGTCG 


ACCAAGACTT CGCTACCGCC 


1200 
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AGGTCGCACC TCGCCGGCGA TTTCCTGTCC TATACGACCA GTTCACGCAG CAGATCGTGG 1260 

CTCCGGCGGC CAAACAGAAG TCACTGAAAA CCACCGCCAA GGTGGTGCGC GCGGCCGTGT 1320 

CGGAGCTACA TCCGGATTCG GCCGTCGTTC TGGTTTTTGT CGACCAGAGC ACTACCAGTA 13 80 

AGGACAGCCC CAATCCGTCG ATGGCGGCCA GCAGCGTGAT GGTGACCCTA GCCAAGGTCG 1440 

ACGGCAATTG GCTGATCACC AAGTTCACCC CGGTTTAGGT TGCCGTAGGC GGTCGCCAAG 150 0 

TCTGACGGGG GCGCGGGTGG CTGCTCGTGC GAGATACCGG CCGTTCTCCG GACAATCACG 1560 

GCCCGACCTC AAACAGATCT CGGCCGCTGT CTAATCGGCC GGGTTATTTA AGATTAGTTG 1620 

CCACTGTATT TACCTGATGT TCAGATTGTT CAGCTGGATT TAGCTTCGCG GCAGGGCGGC 168 0 

TGGTGCACTT TGCATCTGGG GTTGTGACTA CTTGAGAGAA TTTGACCTGT TGCCGACGTT 174 0 

GTTTGCTGTC CATCATTGGT GCTAGTTATG GCCGAGCGGA AGGATTATCG AAGTGGTGGA 180 0 

CTTCGGGGCG TTACCACCGG AGATCAACTC CGCGAGGATG TACGCCGGCC CGGGTTCGGC 1860 

CTCGCTGGTG GCCGCCGCGA AGATGTGGGA CAGCGTGGCG AGTGACCTGT TTTCGGCCGC 1920 

GTCGGCGTTT CAGTCGGTGG TCTGGGGTCT GACGACGGGA TCGTGGATAG GTTCGTCGGC 1980 

GGGTCTGATG GTGGCGGCGG CCTCGCCGTA TGTGGCGTGG ATGAGCGTCA CCGCGGGGCA 2040 

GGCCGAGCTG ACCGCCGCCC AGGTCCGGGT TGCTGCGGCG GCCTACGAGA CGGCGTATGG 2100 

GCTGACGGTG CCCCCGCCGG TGATCGCCGA GAACCGTGCT GAACTGATGA TTCTGATAGC 2160 

GACCAACCTC TTGGGGCAAA ACACCCCGGC GATCGCGGTC AACGAGGCCG AATACGGGGA 2220 
GATGTGGGCC CAAGACGCCG CCGCGATGTT TGGCTACGCC GCCACGGCGG CGACGGCGAC 2280 
CGAGGCGTTG CTGCCGTTCG AGGACGCCCC ACTGATCACC AACCCCGGCG GGCTCCTTGA 2340 
GCAGGCCGTC GCGGTCGAGG AGGCCATCGA CACCGCCGCG GCGAACCAGT TGATGAACAA 2400 
TGTGCCCCAA GCGCTGCAAC AACTGGCCCA GCCCACGAAA AGCATCTGGC CGTTCGACCA 2460 
ACTGAGTGAA CTCTGGAAAG CCATCTCGCC GCATCTGTCG CCGCTCAGCA ACATCGTGTC 2520 
GATGCTCAAC AACCACGTGT CGATGACCAA CTCGGGTGTG TCGATGGCCA GCACCTTGCA 2580 
CTCAATGTTG AAGGGCTTTG CTCCGGCGGC GGCTCAGGCC GTGGAAACCG CGGCGCAAAA 2640 
CGGGGTCCAG GCGATGAGCT CGCTGGGCAG CCAGCTGGGT TCGTCGCTGG GTTCTTCGGG 2700 
TCTGGGCGCT GGGGTGGCCG CCAACTTGGG TCGGGCGGCC TCGGTCGGTT CGTTGTCGGT 276 0 

GCCGCAGGCC TGGGCCGCGG CCAACCAGGC GGTCACCCCG GCGGCGCGGG CGCTGCCGCT 2820 
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GACCAGCCTG ACCAGCGCCG CCCAAACCGC CCCCGGACAC ATGCTGGGCG GGCTACCGCT 2880 

GGGGCAACTG ACCAATAGCG GCGGCGGGTT CGGCGGGGTT AGCAATGCGT TGCGGATGCC 2 940 

GCCGCGGGCG TACGTAATGC CCCGTGTGCC CGCCGCCGGG TAACGCCGAT CCGCACGCAA 3 000 

TGCGGGCCCT CTATGCGGGC AGCGATC 3 027 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

Val Val Asp Phe Gly Ala Leu Pro Pro Glu He Asn Ser Ala Arg Met 
15 10 15 

Tyr Ala Gly Pro Gly Ser Ala Ser Leu Val Ala Ala Ala Lys Met Trp 
20 25 30 

Asp Ser Val Ala Ser Asp Leu Phe Ser Ala Ala Ser Ala Phe Gin Ser 
35 40 45 

Val Val Trp Gly Leu Thr Thr Gly Ser Trp lie Gly Ser Ser Ala Gly 
50 55 60 

Leu Met Val Ala Ala Ala Ser Pro Tyr Val Ala Trp Met Ser Val Thr 
65 70 75 80 

Ala Gly Gin Ala Glu Leu Thr Ala Ala Gin Val Arg Val Ala Ala Ala 
85 90 95 

Ala Tyr Glu Thr Ala Tyr Gly Leu Thr Val Pro Pro Pro Val He Ala 
100 105 HO 

Glu Asn Arg Ala Glu Leu Met He Leu He Ala Thr Asn Leu Leu Gly 

115 120 125 

Gin Asn Thr Pro Ala He Ala Val Asn Glu Ala Glu Tyr Gly Glu Met 
130 135 140 

Trp Ala Gin Asp Ala Ala Ala Met Phe Gly Tyr Ala Ala Thr Ala Ala 
145 150 155 160 

Thr Ala Thr Glu Ala Leu Leu Pro Phe Glu Asp Ala Pro Leu He Thr 
165 170 175 

Asn Pro Gly Gly Leu Leu Glu Gin Ala Val Ala Val Glu Glu Ala He 
180 185 190 

Asp Thr Ala Ala Ala Asn Gin Leu Met Asn Asn Val Pro Gin Ala Leu 
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Gin Gin Leu Ala Gin Pro Thr Lys Ser lie Trp Pro Phe Asp Gin Leu 
210 215 220 

Ser Glu Leu Trp Lys Ala lie Ser Pro His Leu Ser Pro Leu Ser Asn 
225 230 235 240 

lie Val Ser Met Leu Asn Asn His Val Ser Met Thr Asn Ser Gly Val 
245 250 255 

Ser Met Ala Ser Thr Leu His Ser Met Leu Lys Gly Phe Ala Pro Ala 
260 265 270 

Ala Ala Gin Ala Val Glu Thr Ala Ala Gin Asn Gly Val Gin Ala Met 
275 280 285 

Ser Ser Leu Gly Ser Gin Leu Gly Ser Ser Leu Gly Ser Ser Gly Leu 
290 295 300 

Gly Ala Gly Val Ala Ala Asn Leu Gly Arg Ala Ala Ser Val Gly Ser 
305 310 315 320 

Leu Ser Val Pro Gin Ala Trp Ala Ala Ala Asn Gin Ala Val Thr Pro 
325 330 335 

Ala Ala Arg Ala Leu Pro Leu Thr Ser Leu Thr Ser Ala Ala Gin Thr 
340 345 350 

Ala Pro Gly His Met Leu Gly Gly Leu Pro Leu Gly Gin Leu Thr Asn 
355 360 365 

Ser Gly Gly Gly Phe Gly Gly Val Ser Asn Ala Leu Arg Met Pro Pro 
370 375 380 

Arg Ala Tyr Val Met Pro Arg Val Pro Ala Ala Gly 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

CAT CGGAGGG AGTGATCACC ATGCTGTGGC ACGCAATGCC ACCGGAGTAA ATAC CGCACG 60 

GCTGATGGCC GGCGCGGGTC CGGCTCCAAT GCTTGCGGCG GCCG CGGGAT GGCAGACGCT 120 

TTCGGCGGCT CTGGACGCTC AGGCCGTCGA GTTGACCGCG CGCCTGAACT CTCTGGGAGA 180 

AGCCTGGACT GGAGGTGGCA GCGACAAGGC GCTTGCGGCT GCAACGCCGA TGGTGGTCTG 240 
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GCTACAAACC GCGTCAACAC AGGCCAAGAC CCGTGCGATG CAGGCGACGG CGCAAGCCGC 300 

GGCATACACC CAGGCCATGG CCACGACGCC GTCGCTGCCG GAGATCGCCG CCAACCACAT 360 

CACCCAGGCC GTCCTTACGG CCACCAACTT CTTCGGTATC AACACGATCC CGATCGCGTT 420 

GACCGAGATG GATTATTTCA TCCGTATGTG GAACCAGGCA GCCCTGGCAA TGGAGGTCTA 480 

CCAGGCCGAG ACCGCGGTTA ACACGCTTTT CGAGAAGCTC GAGCCGATGG CGTCGATCCT 540 

TGATCCCGGC GCGAGCCAGA GCACGACGAA CCCGATCTTC GGAATGCCCT CCCCTGGCAG 600 

CTCAACACCG GTTGGCCAGT TGCCGCCGGC GGCTACCCAG ACCCTCGGCC AACTGGGTGA 660 

GATGAGCGGC CCGATGCAGC AGCTGACCCA GCCGCTGCAG CAGGTGACGT CGTTGTTCAG 720 

CCAGGTGGGC GGCACCGGCG GCGGCAACCC AGCCGAC GAG GAAGCCGCGC AGATGGGCCT 780 

GCTCGGCACC AGTCCGCTGT CGAACCATCC GCTGGCTGGT GGATCAGGCC CCAGCGCGGG 840 

CGCGGGCCTG CTGCGCGCGG AGTCGCTACC TGGCGCAGGT GGGTCGTTGA CCCGCACGCC 900 

GCTGATGTCT CAGCTGATCG AAAAGCCGGT TGCCCCCTCG GTGATGCCGG CGGCTGCTGC 960 

CGGATCGTCG GCGACGGGTG GCGCCGCTCC GGTGGGTGCG GGAGCGATGG GCCAGGGTGC 1020 

GCAATCCGGC GGCTCCACCA GGCCGGGTCT GGTCGCGCCG GCACCGCTCG CGCAGGAGCG 1080 

TGAAGAAGAC GACGAGGACG ACTGGGACGA AGAGGACGAC TGGTGAGCTC CCGTAATGAC 1140 

AACAGACTTC CCGGCCACCC GGGCCGGAAG ACTTGC CAAC ATTTTGGCGA GGAAGGTAAA 1200 

GAGAGAAAGT AGTCCAGCAT GGCAGAGATG AAGACCGATG CCGCTACCCT CGCGCAGGAG 1260 

GCAGGTAATT TCGAGCGGAT CTCCGGCGAC CTGAAAACCC AGATCGACCA GGTGGAGTCG 1320 

ACGGCAGGTT CGTTGCAGGG CCAGTGGCGC GGCGCGGCGG GGACGGCCGC CCAGGCCGCG 1380 

GTGGTGCGCT TCCAAGAAGC AGCCAATAAG CAGAAGCAGG AACTCGACGA GATCTCGACG 1440 

AATATTCGTC AGGCCGGCGT CCAATACTCG AGGGCCGACG AGGAGCAGCA GCAGGCGCTG 1500 

TCCTCGCAAA TGGGCTTCTG ACCCGCTAAT ACGAAAAGAA ACGGAGCAAA AACATGACAG 1560 

AGCAGCAGTG GAATTTCGCG GGTATCGAGG CCGCGGCAAG CGCAATCCAG GGAAAT 1616 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

CTAGTGGATG GGACCATGGC CATTTTCTGC AGTCTCACTG CCTTCTGTGT TGACATTTTG 6 0 

GCACGCCGGC GGAAACGAAG CACTGGGGTC GAAGAACGGC TGCGCTGCCA TATCGTCCGG 120 

AGCTTCCATA CCTTCGTGCG GCCGGAAGAG CTTGTCGTAG TCGGCCGCCA TGACAACCTC 18 0 

TCAGAGTGCG CTCAAACGTA TAAACACGAG AAAGGGCGAG ACCGACGGAA GGTCGAACTC 24 0 

GCCCGATCCC GTGTTTCGCT ATTCTACGCG AACTCGGCGT TGCCCTATGC GAACATCCCA 300 

GTGACGTTGC CTTCGGT CGA AGCCATTGCC TGACCGGCTT CGCTGATCGT CCGCGCCAGG 360 

TTCTGCAGCG CGTTGTTCAG CTCGGTAGCC GTGGCGTCCC ATTTTTGCTG GACACCCTGG 420 

TACGCCTCCG AA 432 
(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 368 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO -.114: 

Met Leu Trp His Ala Met Pro Pro Glu Xaa Asn Thr Ala Arg Leu Met 
15 10 15 

Ala Gly Ala Gly Pro Ala Pro Met Leu Ala Ala Ala Ala Gly Trp Gin 
20 25 30 

Thr Leu Ser Ala Ala Leu Asp Ala Gin Ala Val Glu Leu Thr Ala Arg 
35 40 45 

Leu Asn Ser Leu Gly Glu Ala Trp Thr Gly Gly Gly Ser Asp Lys Ala 
50 55 60 

Leu Ala Ala Ala Thr Pro Met Val Val Trp Leu Gin Thr Ala Ser Thr 
65 70 75 80 

Gin Ala Lys Thr Arg Ala Met Gin Ala Thr Ala Gin Ala Ala Ala Tyr 

85 90 95 

Thr Gin Ala Met Ala Thr Thr Pro Ser Leu Pro Glu lie Ala Ala Asn 
100 105 HO 

His He Thr Gin Ala Val Leu Thr Ala Thr Asn Phe Phe Gly He Asn 
115 120 125 

Thr He Pro He Ala Leu Thr Glu Met Asp Tyr Phe He Arg Met Trp 
130 135 140 
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Asn Gin Ala Ala Leu Ala Met Glu Val Tyr Gin Ala Glu Thr Ala Val 
145 150 155 160 

Asn Thr Leu Phe Glu Lys Leu Glu Pro Met Ala Ser lie Leu Asp Pro 
165 170 175 

Gly Ala Ser Gin Ser Thr Thr Asn Pro He Phe Gly Met Pro Ser Pro 
180 185 190 

Gly Ser Ser Thr Pro Val Gly Gin Leu Pro Pro Ala Ala Thr Gin Thr 
195 200 205 

Leu Gly Gin Leu Gly Glu Met Ser Gly Pro Met Gin Gin Leu Thr Gin 
210 215 220 

Pro Leu Gin Gin Val Thr Ser Leu Phe Ser Gin Val Gly Gly Thr Gly 
225 230 235 240 

Gly Gly Asn Pro Ala Asp Glu Glu Ala Ala Gin Met Gly Leu Leu Gly 
245 250 255 

Thr Ser Pro Leu Ser Asn His Pro Leu Ala Gly Gly Ser Gly Pro Ser 
260 265 270 

Ala Gly Ala Gly Leu Leu Arg Ala Glu Ser Leu Pro Gly Ala Gly Gly 
275 280 285 

Ser Leu Thr Arg Thr Pro Leu Met Ser Gin Leu He Glu Lys Pro Val 
290 295 300 

Ala Pro Ser Val Met Pro Ala Ala Ala Ala Gly Ser Ser Ala Thr Gly 
305 310 315 320 

Gly Ala Ala Pro Val Gly Ala Gly Ala Met Gly Gin Gly Ala Gin Ser 
325 330 335 

Gly Gly Ser Thr Arg Pro Gly Leu Val Ala Pro Ala Pro Leu Ala Gin 
340 345 350 

Glu Arg Glu Glu Asp Asp Glu Asp Asp Trp Asp Glu Glu Asp Asp Trp 
355 360 365 



INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 



Met Ala Glu Met Lys Thr Asp Ala Ala Thr Leu Ala Gin Glu Ala Gly 
15 io 15 
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Asn Phe Glu Arg lie Ser Gly Asp Leu Lys Thr Gin lie Asp Gin Val 

20 25 30 

Glu Ser Thr Ala Gly Ser Leu Gin Gly Gin Trp Arg Gly Ala Ala Gly 
35 40 "45 

Thr Ala Ala Gin Ala Ala Val Val Arg Phe Gin Glu Ala Ala Asn Lys 
50 55 60 

Gin Lys Gin Glu Leu Asp Glu He Ser Thr Asn He Arg Gin Ala Gly 
65 70 75 80 

Val Gin Tyr Ser Arg Ala Asp Glu Glu Gin Gin Gin Ala Leu Ser Ser 

85 90 95 

Gin Met Gly Phe 
100 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 base pairs 

(B) TYPE : nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

GATCTCCGGC GACCTGAAAA CCCAGATCGA CCAGGTGGAG TCGACGGCAG GTTCGTTGCA 60 

GGGCCAGTGG CGCGGCGCGG CGGGGAC GGC CGCCCAGGCC GCGGTGGTGC GCTTCCAAGA 120 

AGCAGCCAAT AAGCAGAAGC AGGAACTCGA CGAGATCTCG ACGAATATTC GTCAGGCCGG 180 

CGTCCAATAC TCGAGGGCCG ACGAGGAGCA GCAGCAGGCG CTGTCCTCGC AAATGGGCTT 240 

CTGACCCGCT AATACGAAAA GAAACGGAGC AAAAACATGA CAGAGCAGCA GTGGAATTTC 300 

GCGGGTATCG AGGCCGCGGC AAGCGCAATC CAGGGAAATG TCACGTCCAT TCATTCCCTC 360 

CTTGACGAGG GGAAGCAGTC CCTGACCAAG CTCGCA 3 96. 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 



He Ser Gly Asp Leu Lys Thr Gin lie Asp Gin Val Glu Ser Thr Ala 
15 10 15 
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Gly Ser Leu Gin Gly Gin Trp Arg Gly Ala Ala Gly Thr Ala Ala Gin 
20 25 30 

Ala Ala Val Val Arg Phe Gin Glu Ala Ala Asn Lys Gin Lys Gin Glu 
3 5 40 45 

Leu Asp Glu lie Ser Thr Asn lie Arg Gin Ala Gly Val Gin Tyr Ser 
50 55 60 

Arg Ala Asp Glu Glu Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe 
65 70 75 80 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

GTGGATCCCG ATCCCGTGTT TCGCTATTCT ACGCGAACTC GGCGTTGCCC TATGCGAACA 60 

TCCCAGTGAC GTTGC CTTCG GTCGAAGCCA TTGCCTGACC GGCTTCGCTG ATCGTCCGCG 120 

CCAGGTTCTG CAGCGCGTTG TTCAGCTCGG TAGCCGTGGC GTCCCATTTT TGCTGGACAC 180 

CCTGGTACGC CTCCGAACCG CTACCGCCCC AGGCCGCTGC GAGCTTGGTC AGGGACTGCT 240 

TCCCCTCGTC AAGGAGGGAA TGAATGGACG TGACATTTCC CTGGATTGCG CTTGCCGCGG 300 

CCTCGATACC CGCGAAATTC CACTGCTGCT CTGTCATGTT TTTGCTCCGT T T CT T TTCGT 360 

ATTAGCGGGT CAGAAGCCCA TTTGCGA 387 

(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

CGGCACGAGG ATCTCGGTTG GCCCAACGGC GCTGGCGAGG GCTCCGTTCC GGGGGCGAGC 60 

TGCGCGCCGG ATGCTTCCTC TGCCCGCAGC CGCGC CTGGA TGGATGGACC AGTTGCTACC 120 

TTCCCGACGT TTCGTTCGGT GTCTGTGCGA TAGCGGTGAC CCCGGCGCGC ACGTCGGGAG 18 0 

TGTTGGGGGG CAGGCCGGGT CGGTGGTTCG GCCGGGGACG CAGACGGTCT GGACGGAACG 240 
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GGCGGGGGTT CGCCGATTGG CATCTTTGCC CA 
(2) INFORMATION FOR SEQ ID NO: 12 0: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Asp Pro Val Asp Ala Val lie Asn Thr Thr Cys Asn Tyr Gly Gin Val 
1 5 10 15 

Val Ala Ala Leu 

20 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Ala Val Glu Ser Gly Met Leu Ala Leu Gly Thr Pro Ala Pro Ser 
1 5 ■ 10 15 

(2) INFORMATION FOR SEQ ID NO: 12 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

Ala Ala Met Lys Pro Arg Thr Gly Asp Gly Pro Leu Glu Ala Ala Lys 
1 5 10 15 

Glu Gly Arg 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Tyr Tyr Trp Cys Pro Gly Gin Pro Phe Asp Pro Ala Trp Gly Pro 
15 10 is 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Asp lie Gly Ser Glu Ser Thr Glu Asp Gin Gin Xaa Ala Val 

15 10 

(2) INFORMATION FOR SEQ ID NO: 12 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 

Ala Glu Glu Ser lie Ser Thr Xaa Glu Xaa lie Val Pro 
15 10 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

Asp Pro Glu Pro Ala Pro Pro Val Pro Thr Thr Ala Ala Ser Pro Pro 
15 10 15 

Ser 



(2) INFORMATION FOR SEQ ID NO: 127 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



138 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

Ala Pro Lys Thr Tyr Xaa Glu Glu Leu Lys Gly Thr Asp Thr Gly 
1 5 10 15 

INFORMATION FOR SEQ ID NO: 12 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 8: 

Asp Pro Ala Ser Ala Pro Asp Val Pro Thr Ala Ala Gin Leu Thr Ser 
1 5 10 15 

Leu Leu Asn Ser Leu Ala Asp Pro Asn Val Ser Phe Ala Asn 

20 25 30 

INFORMATION FOR SEQ ID NO: 12 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

Asp Pro Pro Asp Pro His Gin Xaa Asp Met Thr Lys Gly Tyr Tyr Pro 
1 5 10 15 

Gly Gly Arg Arg Xaa Phe 
20 

INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

Asp Pro Gly Tyr Thr Pro Gly 
1 5 

INFORMATION FOR SEQ ID NO: 131: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "The Second Residue Can Be Either a 
Pro or Thr" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Xaa Xaa Gly Phe Thr Gly Pro Gin Phe Tyr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 132: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
<D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "The Third Residue Can Be Either a 

Gin or Leu" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

Xaa Pro Xaa Val Thr Ala Tyr Ala Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 3: 

Xaa Xaa Xaa Glu Lys Pro Phe Leu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:134: 



Xaa Asp Ser Glu Lys Ser Ala Thr lie Lys Val Thr Asp Ala Ser 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 13 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 135: 

Ala Gly Asp Thr Xaa lie Tyr He Val Gly Asn Leu Thr Ala Asp 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 13 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Ala Pro Glu Ser Gly Ala Gly Leu Gly Gly Thr Val Gin Ala Gly 

1-5 10 15 

(2) INFORMATION FOR SEQ ID NO: 13 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 7: 

Xaa Tyr lie Ala Tyr Xaa Thr Thr Ala Gly He Val Pro Gly Lys He 
15 10 15 

Asn Val His Leu Val 
20 

(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 8: 
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GCAACGCTGT CGTGGCCTTT GCGGTGATCG GTTTCGCCTC GCTGGCGGTG GCGGTGGCGG 60 

TCACCATCCG ACCGACCGCG GCCTCAAAAC CGGTAGAGGG ACACCAAAAC GCCCAGCCAG 120 

GGAAGTTCAT GCCGTTGTTG CCGACGCAAC AGCAGGCGCC GGTCCCGCCG CCTCCGCCCG 180 

ATGATCCCAC CGCTGGATTC CAGGGCGGCA CCATTCCGGC TGTACAGAAC GTGGTGCCGC 240 

GGCCGGGTAC CTCACCCGGG GTGGGTGGGA CGCCGGCTTC GCCTGCGCCG GAAGCGCCGG 300 

CCGTGCCCGG TGTTGTGCCT GCCCCGGTGC CAATCCCGGT CCCGATCATC ATTCCCCCGT 36 0 

TCCCGGGTTG GCAGCCTGGA ATGCCGACCA TCCCCACCGC ACCGCCGACG ACGCCGGTGA 420 

CCACGTCGGC GACGACGCCG CCGACCACGC CGCCGACCAC GCCGGTGACC ACGCCGCCAA 48 0 

CGACGCCGCC GACCACGCCG GTGACCACGC CGCCAACGAC GCCGCCGACC ACGCCGGTGA 540 

CCACGCCACC AACGACCGTC GCCCCGACGA CCGTCGCCCC GACGACGGTC GCTCCGACCA 600 

CCGTCGCCCC GACGACGGTC GCTCCAGCCA CCGCCACGCC GACGAC CGTC GCTCCGCAGC 660 

CGACG CAGCA GCCCACGCAA CAACCAACCC AACAGATGCC AACCCAGCAG CAGACCGTGG 720 

CCCCGCAGAC GGTGGCGCCG GCTCCGCAGC CGCCGTCCGG TGGCCGCAAC GGCAGCGGCG 780 

GGGGCGACTT ATTCGGCGGG TTCTGATCAC GGTCGCGGCT TCACTACGGT CGGAGGACAT 840 

GGCCGGTGAT GCGGTGACGG TGGTGCTGCC CTGTCTCAAC GA 882 
(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

CCATCAACCA ACCGCTCGCG CCGCCCGCGC CGCCGGATCC GCCGTCGCCG CCACGCCCGC 60 

CGGTGCCTCC GGTGCCCCCG TTGCCGCCGT CGCCGCCGTC GCCGCCGACC GGCTGGGTGC 120 

CTAGGGCGCT GTTACCGCCC TGGTTGGCGG GGACGCCGCC GGCACCACCG GTACCGCCGA 18 0 

TGGCGCCGTT GCCGCCGGCG GCACCGTTGC CACCGTTGCC ACCGTTGCCA CCGTTGCCGA 240 

CCAGCCACCC GCCGCGACCA CCGGCACCGC CGGCGCCGCC CGCACCGCCG GCGTGCCCGT 3 00 

TCGTGCCCGT ACCGCCGGCA CCGCCGTTGC CGCCGTCACC GCCGACGGAA CTACCGGCGG 360 

ACGCGGCCTG CCCGCCGGCG CCGCCCGCAC CGCCATTGGC ACCGCCGTCA CCGCCGGCTG 420 
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GGAGTGCCGC GATTAGGGCA CTGACCGGCG CAACCAGCGC AAGTACTCTC GGTCACCGAG 480 

CACTTCCAGA CGACACCACA GCACGGGGTT GTCGGCGGAC TGGGTGAAAT GGCAGCCGAT 540 

AGCGGCTAGC TGTCGGCTGC GGTCAACCTC GATCATGATG TCGAGGTGAC CGTGACCGCG 600 

CCCCCCGAAG GAGGCGCTGA ACTCGGCGTT GAG CCGATCG GCGATCGGTT GGGGCAGTGC 660 

CCAGGCCAAT ACGGGGATAC CGGGTGTCNA AGCCGCCGCG AGCGCAGCTT CGGTTGCGCG 72 0 

ACNGTGGTCG GGGTGGCCTG TTACGCCGTT GTCNTCGAAC ACGAGTAGCA GGTCTGCTCC 78 0 

GGCGAGGGCA TCCACCACGC GTTGCGTCAG CTCGT 815 
(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1152 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

ACCAGCCGCC GGCTGAGGTC TCAGATCAGA GAGTCTCCGG ACTCACCGGG GCGGTTCAGC 60 

CTTCTCCCAG AACAACTGCT GAAGATC CTC GCCCGCGAAA GAGGCGCTGA TTTGACGCTC 120 

TATGACCGGT TGAACGACGA GATCATCCGG CAGATTGATA TGGCACCGCT GGGCTAACAG 180 

GTGCGCAAGA TGGTGCAGCT GTATGTCTCG GACTCCGTGT CGCGGATCAG CTTTGCCGAC 240 

GGCCGGGTGA TCGTGTGGAG CGAGGAGCTC GGCGAGAGCC AGTATCCGAT CGAGACGCTG 300 

GACGGCATCA CGCTGTTTGG GCGGCCGACG ATGACAACGC CCTTCATCGT TGAGATGCTC 360 

AAGCGTGAGC GCGACATCCA GCTCTTCACG ACCGACGGCC ACTACCAGGG CCGGATCTCA 420 

ACACCCGACG TGTCATACGC GCCGCGGCTC CGTCAGCAAG TTCACCGCAC CGACGATCCT 480 

GCGTTCTGCC TGTCGTTAAG CAAGCGGATC GTGTCGAGGA AGATCCTGAA TCAGCAGGCC 540 

TTGATTCGGG CACACACGTC GGGGCAAGAC GTTGCTGAGA GCATCCGCAC GATGAAGCAC 600 

TCGCTGGCCT GGGTCGATCG ATCGGGCTCC CTGG CGGAGT TGAACGGGTT CGAGGGAAAT 660 

GCCGCAAAGG CATACTTCAC CGCGCTGGGG CATCTCGTCC CGCAGGAGTT CGCATTCCAG 720 

GGCCGCTCGA CTCGGCCGCC GTTGGACGCC TTCAACTCGA TGGTCAGCCT CGGCTATTCG 78 0 

CTGCTGTACA AGAACATCAT AGGGGCGATC GAGCGTCACA GCCTGAACGC GTATATCGGT 840 
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TTCCTACACC AGGATTCACG AGGGCACGCA ACGTCTCGTG CCGAATTCGG CACGAGCTCC 900 

GCTGAAACCG CTGGCCGGCT GCTCAGTGCC CGTACGTAAT CCGCTGCGCC CAGGCCGGCC 960 

CGCCGGCCGA ATACCAGCAG ATCGGACAGC GAATTGCCGC CCAGCCGGTT GGAGCCGTGC 1020 

ATACCGCCGG CACACTCACC GGCAGCGAAC AGGCCTGGCA CCGTGGCGGC GCCGGTGTCC 108 0 

GCGTCTACTT CGACACCGCC CATCACGTAG TGACACGTCG GCCCGACTTC CATTGC CTGC 114 0 

GTTCGGCACG AG 1152 
(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

CTCGTGCCGA TTCGGCAGGG TGTACTTGCC GGTGGTGTAN GCCGCATGAG TGCCGACGAC 60 

CAGCAATGCG GCAACAGCAC GGATCCCGGT CAACGACGCC ACCCGGTCCA CGTGGGCGAT 12 0 

CCGCTCGAGT CCGCCCTGGG CGGCTCTTTC CTTGGGCAGG GTCATCCGAC GTGTTTCCGC 18 0 

CGTGGTTTGC CGCCATTATG CCGGCGCGCC GCGTCGGGCG GCCGGTATGG CCGAANGTCG 24 0 

ATCAGCACAC CCGAGATACG GGTCTGTGCA AGCTTTTTGA GCGTCGCGCG GGGCAGCTTC 300 

GCCGGCAATT CTACTAGCGA GAAGTCTGGC CCGATACGGA TCTGACCGAA GTCGCTGCGG 360 

TGCAGCCCAC CCTCATTGGC GATGGCGCCG ACGATGGCGC CTGGACCGAT CTTGTGCCGC 420 

TTGCCGACGG CGACGCGGTA GGTGGTCAAG TCCGGTCTAC GCTTGGGCCT TTGCGGACGG 480 

TCCCGACGCT GGTCGCGGTT GCGC CGCGAA AGCGGCGGGT CGGGTGC CAT CAGGAATGCC 540 

TCACCGCCGC GGCACTGCAC GGCCAGTGCC GCGGCGATGT CAGCCATCGG GACATCATGC 600 

TCGCGTTCAT ACTCCTCGAC CAGTCGGCGG AACAGCTCGA TTCCCGGACC GCCCA 655 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

Asn Ala Val Val Ala Phe Ala Val He Gly Phe Ala Ser Leu Ala Val 
15 10 15 

Ala Val Ala Val Thr He Arg Pro Thr Ala Ala Ser Lys Pro Val Glu 
20 25 30 

Gly His Gin Asn Ala Gin Pro Gly Lys Phe Met Pro Leu Leu Pro Thr 
35 40 45 

Gin Gin Gin Ala Pro Val Pro Pro Pro Pro Pro Asp Asp Pro Thr Ala 
50 55 60 

Gly Phe Gin Gly Gly Thr He Pro Ala Val Gin Asn Val Val Pro Arg 



Pro Gly Thr Ser Pro Gly Val Gly Gly Thr Pro Ala Ser Pro Ala Pro 
85 90 95 

Glu Ala Pro Ala Val Pro Gly Val Val Pro Ala Pro Val Pro He Pro 
100 105 110 

Val Pro He He He Pro Pro Phe Pro Gly Trp Gin Pro Gly Met Pro 
115 120 125 

Thr He Pro Thr Ala Pro Pro Thr Thr Pro Val Thr Thr Ser Ala Thr 
130 135 140 

Thr Pro Pro Thr Thr Pro Pro Thr Thr Pro Val Thr Thr Pro Pro Thr 
145 150 155 160 

Thr Pro Pro Thr Thr Pro Val Thr Thr Pro Pro Thr Thr Pro Pro Thr 
165 170 175 

Thr Pro Val Thr Thr Pro Pro Thr Thr Val Ala Pro Thr Thr Val Ala 

180 185 190 

Pro Thr Thr Val Ala Pro Thr Thr Val Ala Pro Thr Thr Val Ala Pro 
195 200 205 

Ala Thr Ala Thr Pro Thr Thr Val Ala Pro Gin Pro Thr Gin Gin Pro 
210 215 220 

Thr Gin Gin Pro Thr Gin Gin Met Pro Thr Gin Gin Gin Thr Val Ala 

225 230 235 240 

Pro Gin Thr Val Ala Pro Ala Pro Gin Pro Pro Ser Gly Gly Arg Asn 
245 250 255 

Gly Ser Gly Gly Gly Asp Leu Phe Gly Gly Phe 
260 265 

INFORMATION FOR SEQ ID NO : 143 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

lie Asn Gin Pro Leu Ala Pro Pro Ala Pro Pro Asp Pro Pro Ser Pro 
15 10 15 

Pro Arg Pro Pro Val Pro Pro Val Pro Pro Leu Pro Pro Ser Pro Pro 
20 25 30 

Ser Pro Pro Thr Gly Trp Val Pro Arg Ala Leu Leu Pro Pro Trp Leu 
35 40 45 

Ala Gly Thr Pro Pro Ala Pro Pro Val Pro Pro Met Ala Pro Leu Pro 
50 55 60 

Pro Ala Ala Pro Leu Pro Pro Leu Pro Pro Leu Pro Pro Leu Pro Thr 
65 70 75 80 

Ser His Pro Pro Arg Pro Pro Ala Pro Pro Ala Pro Pro Ala Pro Pro 
85 90 95 

Ala Cys Pro Phe Val Pro Val Pro Pro Ala Pro Pro Leu Pro Pro Ser 
100 105 110 

Pro Pro Thr Glu Leu Pro Ala Asp Ala Ala Cys Pro Pro Ala Pro Pro 
115 120 125 

Ala Pro Pro Leu Ala Pro Pro Ser Pro Pro Ala Gly Ser Ala Ala lie 
130 135 140 

Arg Ala Leu Thr Gly Ala Thr Ser Ala Ser Thr Leu Gly His Arg Ala 
145 150 155 160 

Leu Pro Asp Asp Thr Thr Ala Arg Gly Cys Arg Arg Thr Gly 
165 170 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
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Gin Pro Pro Ala Glu Val Ser Asp Gin Arg Val Ser Gly Leu Thr Gly 
15 10 15 

Ala Val Gin Pro Ser Pro Arg Thr Thr Ala Glu Asp Pro Arg Pro Arg 

20 25 - 30 

Asn Arg Arg 
35 

INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 5: 

Arg Ala Asp Ser Ala Gly Cys Thr Cys Arg Trp Cys Xaa Pro His Glu 
15 10 15 

Cys Arg Arg Pro Ala Met Arg Gin Gin His Gly Ser Arg Ser Thr Thr 
20 25 30 

Pro Pro Gly Pro Arg Gly Arg Ser Ala Arg Val Arg Pro Gly Arg Leu 
35 40 45 

Phe Pro Trp Ala Gly Ser Ser Asp Val Phe Pro Pro Trp Phe Ala Ala 
50 55 60 

lie Met Pro Ala Arg Arg Val Gly Arg Pro Val Trp Pro Xaa Val Asp 
65 70 75 80 

Gin His Thr Arg Asp Thr Gly Leu Cys Lys Leu Phe Glu Arg Arg Ala 
85 90 95 

Gly Gin Leu Arg Arg Gin Phe Tyr 
100 

INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR primer" 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Mycobacterium tuberculosis 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 
GGATCCATAT GGGCCATCAT CATCATCATC ACGTGATCGA CATCATCGGG ACC 
(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR Primer" 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

CCTGAATTCA GGCCTCGGTT GCGCCGGCCT CATCTTGAAC GA 

(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR Primer" 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

GGATCCTGCA GGCTCGAAAC CACCGAGCGG T 

(2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR primer" 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

CTCTGAATTC AGCGCTGGAA ATCGTCGCGA T 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR primer" 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

GGATCCAGCG CTGAGATGAA GACCGATGCC GCT 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " PCR primer" 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

GAGAGAATTC TCAGAAGCCC ATTTGCGAGG ACA 

(2) INFORMATION FOR SEQ ID NO: 15 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1993 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 



(ix) FEATURE: 



149 



(A) NAME/KEY: CDS 

(B) LOCATION: 152.. 1273 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

TGTTCTTCGA CGGCAGGCTG GTGGAGGAAG GGCCCACCGA ACAGCTGTTG- TCCTCGCCGA 60 

AGCATGCGGA AACCGCCCGA TACGTCGCCG GACTGTCGGG GGACGTCAAG GACGCCAAGC 120 

GCGGAAATTG AAGAGCACAG AAAGGTATGG C GTG AAA ATT CGT TTG CAT ACG 172 

Val Lys He Arg Leu His Thr 



CTG TTG GCC GTG TTG ACC GCT GCG CCG CTG CTG CTA GCA GCG GCG GGC 
Leu Leu Ala Val Leu Thr Ala Ala Pro Leu Leu Leu Ala Ala Ala Gly 



TGT GGC TCG AAA CCA CCG AGC GGT TCG CCT GAA ACG GGC GCC GGC GCC 
Cys Gly Ser Lys Pro Pro Ser Gly Ser Pro Glu Thr Gly Ala Gly Ala 



GGT ACT GTC GCG ACT ACC CCC GCG TCG TCG CCG GTG ACG TTG GCG GAG 
Gly Thr Val Ala Thr Thr Pro Ala Ser Ser Pro Val Thr Leu Ala Glu 



ACC GGT AGC ACG CTG CTC TAC CCG CTG TTC AAC CTG TGG GGT CCG GCC 
Thr Gly Ser Thr Leu Leu Tyr Pro Leu Phe Asn Leu Trp Gly Pro Ala 



TTT CAC GAG AGG TAT CCG AAC GTC ACG ATC ACC GCT CAG GGC ACC GGT 
Phe His Glu Arg Tyr Pro Asn Val Thr He Thr Ala Gin Gly Thr Gly 



TCT GGT GCC GGG ATC GCG CAG GCC GCC GCC GGG ACG GTC AAC ATT GGG 
Ser Gly Ala Gly He Ala Gin Ala Ala Ala Gly Thr Val Asn He Gly 
90 95 100 

GCC TCC GAC GCC TAT CTG TCG GAA GGT GAT ATG GCC GCG CAC AAG GGG 
Ala Ser Asp Ala Tyr Leu Ser Glu Gly Asp Met Ala Ala His Lys Gly 
105 110 115 

CTG ATG AAC ATC GCG CTA GCC ATC TCC GCT CAG CAG GTC AAC TAC AAC 
Leu Met Asn He Ala Leu Ala He Ser Ala Gin Gin Val Asn Tyr Asn 
120 125 130 135 

CTG CCC GGA GTG AGC GAG CAC CTC AAG CTG AAC GGA AAA GTC CTG GCG 
Leu Pro Gly Val Ser Glu His Leu Lys Leu Asn Gly Lys Val Leu Ala 
140 145 150 

GCC ATG TAC CAG GGC ACC ATC AAA ACC TGG GAC GAC CCG CAG ATC GCT 
Ala Met Tyr Gin Gly Thr He Lys Thr Trp Asp Asp Pro Gin He Ala 
155 160 165 



GCG CTC AAC CCC GGC GTG AAC CTG CCC GGC ACC GCG GTA GTT CCG CTG 
Ala Leu Asn Pro Gly Val Asn Leu Pro Gly Thr Ala Val Val Pro Leu 



700 
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170 175 180 

CAC CGC TCC GAC GGG TCC GGT GAC ACC TTC TTG TTC ACC CAG TAC CTG 748 
His Arg Ser Asp Gly Ser Gly Asp Thr Phe Leu Phe Thr Gin Tyr Leu 
185 190 195 

TCC AAG CAA GAT CCC GAG GGC TGG GGC AAG TCG CCC GGC TTC GGC ACC 796 
Ser Lys Gin Asp Pro Glu Gly Trp Gly Lys Ser Pro Gly Phe Gly Thr 
200 205 210 215 

ACC GTC GAC TTC CCG GCG GTG CCG GGT GCG CTG GGT GAG AAC GGC AAC 844 
Thr Val Asp Phe Pro Ala Val Pro Gly Ala Leu Gly Glu Asn Gly Asn 
220 225 230 

GGC GGC ATG GTG ACC GGT TGC GCC GAG ACA CCG GGC TGC GTG GCC TAT 892 
Gly Gly Met Val Thr Gly Cys Ala Glu Thr Pro Gly Cys Val Ala Tyr 
235 240 245 

ATC GGC ATC AGC TTC CTC GAC CAG GCC AGT CAA CGG GGA CTC GGC GAG 94 0 

lie Gly lie Ser Phe Leu Asp Gin Ala Ser Gin Arg Gly Leu Gly Glu 
250 255 260 

GCC CAA CTA GGC AAT AGC TCT GGC AAT TTC TTG TTG CCC GAC GCG CAA 988 
Ala Gin Leu Gly Asn Ser Ser Gly Asn Phe Leu Leu Pro Asp Ala Gin 
265 270 275 

AGC ATT CAG GCC GCG GCG GCT GGC TTC GCA TCG AAA ACC CCG GCG AAC 103 6 

Ser lie Gin Ala Ala Ala Ala Gly Phe Ala Ser Lys Thr Pro Ala Asn 
280 285 290 295 

CAG GCG ATT TCG ATG ATC GAC GGG CCC GCC CCG GAC GGC TAC CCG ATC 1084 
Gin Ala lie Ser Met lie Asp Gly Pro Ala Pro Asp Gly Tyr Pro lie 
300 305 310 

ATC AAC TAC GAG TAC GCC ATC GTC AAC AAC CGG CAA AAG GAC GCC GCC 1132 
lie Asn Tyr Glu Tyr Ala He Val Asn Asn Arg Gin Lys Asp Ala Ala 
315 320 325 

ACC GCG CAG ACC TTG CAG GCA TTT CTG CAC TGG GCG ATC ACC GAC GGC 1180 
Thr Ala Gin Thr Leu Gin Ala Phe Leu His Trp Ala He Thr Asp Gly 
330 335 340 

AAC AAG GCC TCG TTC CTC GAC CAG GTT CAT TTC CAG CCG CTG CCG CCC 1228 
Asn Lys Ala Ser Phe Leu Asp Gin Val His Phe Gin Pro Leu Pro Pro 
345 350 355 

GCG GTG GTG AAG TTG TCT GAC GCG TTG ATC GCG ACG ATT TCC AGC 1273 

Ala Val Val Lys Leu Ser Asp Ala Leu He Ala Thr He Ser Ser 
360 365 370 

TAGCCTCGTT GACCACCACG CGACAGCAAC CTCCGTCGGG CCATCGGGCT GCTTTGCGGA 1333 
GCATGCTGGC CCGTGC CGGT GAAGTCGGCC GCGCTGGCCC GGCCATCCGG TGGTTGGGTG 1393 



GGATAGGTGC GGTGATCCCG CTGCTTGCGC TGGTCTTGGT GCTGGTGGTG CTGGTCATCG 1453 
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AGGCGATGGG TGCGATCAGG CTCAACGGGT TGCATTTCTT CACCGCCACC GAATGGAATC 1513 

CAGGCAACAC CTACGGCGAA AC CGTTGTCA CCGACGCGTC GCCCATCCGG TCGGCGCCTA 1573 

CTACGGGGCG TTGCCGCTGA TCGTCGGGAC GCTGGCGACC TCGGCAATCG CCCTGATCAT 1633 

CGCGGTGCCG GTCTCTGTAG GAGCGGCGCT GGTGATCGTG GAACGGCTGC CGAAACGGTT 1693 

GGCCGAGGCT GTGGGAATAG TCCTGGAATT GCTCGCCGGA ATCCCCAGCG TGGTCGTCGG 1753 

TTTGTGGGGG GCAATGACGT TCGGGCCGTT CATCGCTCAT CACATCGCTC CGGTGATCGC 1813 

TCACAACGCT CCCGATGTGC CGGTGCTGAA CTACTTGCGC GGCGACCCGG GCAACGGGGA 1873 

GGGCATGTTG GTGT CCGGTC TGGTGTTGGC GGTGATGGTC GTTCCCATTA TCGCCACCAC 1933 

CACTCATGAC CTGTTCCGGC AGGTGCCGGT GTTGCCC CGG GAGGGCGCGA TCGGGAATTC 1993 
(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 374 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

Val Lys lie Arg Leu His Thr Leu Leu Ala Val Leu Thr Ala Ala Pro 
15 10 15 

Leu Leu Leu Ala Ala Ala Gly Cys Gly Ser Lys Pro Pro Ser Gly Ser 
20 25 30 

Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro Ala Ser 
35 40 45 

Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr Pro Leu 

50 55 60 

Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn Val Thr 



He Thr Ala Gin Gly Thr Gly Ser Gly Ala Gly He Ala Gin Ala Ala 
85 90 95 

Ala Gly Thr Val Asn He Gly Ala Ser Asp Ala Tyr Leu Ser Glu Gly 
100 105 110 

Asp Met Ala Ala His Lys Gly Leu Met Asn He Ala Leu Ala He Ser 
115 120 125 

Ala Gin Gin Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His Leu Lys 
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Leu Asn Gly Lys Val Leu Ala Ala Met Tyr Gin Gly Thr lie Lys Thr 
145 150 155 160 

Trp Asp Asp Pro Gin lie Ala Ala Leu Asn Pro Gly Val Asn Leu Pro 
165 170 175 

Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly Asp Thr 
180 185 190 

Phe Leu Phe Thr Gin Tyr Leu Ser Lys Gin Asp Pro Glu Gly Trp Gly 
195 200 205 

Lys Ser Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val Pro Gly 
210 215 220 

Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys Ala Glu 
225 230 235 240 

Thr Pro Gly Cys Val Ala Tyr lie Gly lie Ser Phe Leu Asp Gin Ala 
245 250 255 

Ser Gin Arg Gly Leu Gly Glu Ala Gin Leu Gly Asn Ser Ser Gly Asn 
260 265 270 

Phe Leu Leu Pro Asp Ala Gin Ser lie Gin Ala Ala Ala Ala Gly Phe 
275 280 285 

Ala Ser Lys Thr Pro Ala Asn Gin Ala lie Ser Met lie Asp Gly Pro 
290 295 300 

Ala Pro Asp Gly Tyr Pro lie lie Asn Tyr Glu Tyr Ala lie Val Asn 
305 310 315 320 

Asn Arg Gin Lys Asp Ala Ala Thr Ala Gin Thr Leu Gin Ala Phe Leu 
325 330 335 

His Trp Ala He Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp Gin Val 
340 345 350 

His Phe Gin Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp Ala Leu 
355 360 365 

He Ala Thr He Ser Ser 
370 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1993 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

TGTTCTTCGA CGGCAGGCTG GTGGAGGAAG GGCCCACCGA ACAGCTGTTC TCCTCGCCGA 60 

AGCATG CGGA AACCGCCCGA TACGTCGCCG GACTGTCGGG GGACGTCAAG GACGCCAAGC 120 

GCGGAAATTG AAGAGCACAG AAAGGTATGG CGTGAAAATT CGTTTGCATA CGCTGTTGGC 180 

CGTGTTGACC GCTGCGCCGC TGCTGCTAGC AGCGGCGGGC TGTGGCTCGA AACCACCGAG 240 

CGGTTCGCCT GAAACGGGCG CCGGCGCCGG TACTGTCGCG ACTACCCCCG CGTCGTCGCC 300 

GGTGACGTTG GCGGAGACCG GTAGCACGCT GCTCTACCCG CTGTTCAACC TGTGGGGTCC 360 

GGCCTTTCAC GAGAGGTATC CGAACGTCAC GATCACCGCT CAGGGCACCG GTTCTGGTGC 420 

CGGGATCGCG CAGGCCGCCG CCGGGACGGT CAACATTGGG GCCTCCGACG CCTATCTGTC 480 

GGAAGGTGAT ATGGCCGCGC ACAAGGGGCT GATGAACATC GCGCTAGCCA TCTCCGCTCA 540 

GCAGGTCAAC TACAACCTGC CCGGAGTGAG CGAGCACCTC AAGCTGAACG GAAAAGTCCT 600 

GGCGGCCATG TACCAGGGCA CCATCAAAAC CTGGGACGAC CCGCAGATCG CTGCGCTCAA 660 

CCCCGGCGTG AACCTGCCCG GCACCGCGGT AGTTCCGCTG CACCGCTCCG ACGGGTCCGG 720 

TGACACCTTC TTGTTCACCC AGTACCTGTC CAAGCAAGAT CCCGAGGGCT GGGGCAAGTC 780 

GCCCGGCTTC GGCACCACCG TCGACTTCCC GGCGGTGCCG GGTGCGCTGG GTGAGAACGG 840 

CAACGGCGGC ATGGTGACCG GTTGCGCCGA GACACCGGGC TGCGTGGCCT ATATCGGCAT 900 

CAGCTTCCTC GACCAGGCCA GTCAACGGGG ACTCGGCGAG GCCCAACTAG GCAATAGCTC 960 

TGGCAATTTC TTGTTGCCCG ACGCGCAAAG CATTCAGGCC GCGGCGGCTG GCTTCGCATC 1020 

GAAAACCCCG GCGAACCAGG CGATTTCGAT GATCGACGGG CCCGCCCCGG ACGGCTACCC 1080 

GATCATCAAC TACGAGTACG CCATCGTCAA CAACCGGCAA AAGGACGCCG CCACCGCGCA 1140 

GACCTTGCAG GCATTTCTGC ACTGGGCGAT CACCGACGGC AACAAGGCCT CGTTCCTCGA 1200 

CCAGGTTCAT TTCCAGCCGC TGCCGCCCGC GGTGGTGAAG TTGTCTGACG CGTTGATCGC 1260 

GACGATTTCC AGCTAGCCTC GTTGACCACC ACGCGACAGC AACCTCCGTC GGGCCATCGG 132 0 

GCTGCTTTGC GGAGCATGCT GGCCCGTGCC GGTGAAGTCG GCCGCGCTGG CCCGGCCATC 1380 

CGGTGGTTGG GTGGGATAGG TGCGGTGATC CCGCTGCTTG CGCTGGTCTT GGTGCTGGTG 1440 

GTGCTGGTCA TCGAGGCGAT GGGTGCGATC AGGCTCAACG GGTTGCATTT CTTCACCGCC 1500 

ACCGAATGGA ATCCAGGCAA CACCTACGGC GAAACCGTTG TCACCGACGC GTCGCCCATC 1560 

CGGTCGGCGC CTACTACGGG GCGTTGCCGC TGATCGTCGG GACGCTGGCG ACCTCGGCAA 1620 
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TCGCCCTGAT CATCGCGGTG CCGGTCTCTG TAGGAGCGGC GCTGGTGATC GTGGAACGGC 1680 

TGCCGAAACG GTTGGCCGAG GCTGTGGGAA TAGTCCTGGA ATTGCTCGCC GGAATCCCCA 1740 

GCGTGGTCGT CGGTTTGTGG GGGGCAATGA CGTTCGGGCC GTTCATCGCT CATCACATCG 1800 

CTCCGGTGAT CGCTCACAAC GCTCCCGATG TGCCGGTGCT GAACTACTTG CGCGGCGACC 18 60 

CGGGCAACGG GGAGGGCATG TTGGTGTCCG GTCTGGTGTT GGCGGTGATG GTCGTTCCCA 1920 

TTATCGCCAC CACCACTCAT GACCTGTTCC GGCAGGTGCC GGTGTTGCCC CGGGAGGGCG 1980 

CGATCGGGAA TTC 1993 
(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 374 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

Met Lys He Arg Leu His Thr Leu Leu Ala Val Leu Thr Ala Ala Pro 
15 10 15 

Leu Leu Leu Ala Ala Ala Gly Cys Gly Ser Lys Pro Pro Ser Gly Ser 
20 25 30 

Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro Ala Ser 
35 40 45 

Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr Pro Leu 
50 55 60 

Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn Val Thr 
65 70 75 80 

He Thr Ala Gin Gly Thr Gly Ser Gly Ala Gly He Ala Gin Ala Ala 
85 90 95 

Ala Gly Thr Val Asn He Gly Ala Ser Asp Ala Tyr Leu Ser Glu Gly 
100 105 110 

Asp Met Ala Ala His Lys Gly Leu Met Asn He Ala Leu Ala He Ser 
115 120 125 

Ala Gin Gin Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His Leu Lys 
130 135 140 

Leu Asn Gly Lys Val Leu Ala Ala Met Tyr Gin Gly Thr He Lys Thr 
145 150 155 160 
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Trp Asp Asp Pro Gin lie Ala Ala Leu Asn Pro Gly Val Asn Leu Pro 
165 170 175 

Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly Asp Thr 
180 185 190 

Phe Leu Phe Thr Gin Tyr Leu Ser Lys Gin Asp Pro Glu Gly Trp Gly 
195 200 205 

Lys Ser Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val Pro Gly 
210 215 220 

Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys Ala Glu 
225 230 235 240 

Thr Pro Gly Cys Val Ala Tyr lie Gly lie Ser Phe Leu Asp Gin Ala 
245 250 255 

Ser Gin Arg Gly Leu Gly Glu Ala Gin Leu Gly Asn Ser Ser Gly Asn 
260 265 270 

Phe Leu Leu Pro Asp Ala Gin Ser lie Gin Ala Ala Ala Ala Gly Phe 
275 280 285 

Ala Ser Lys Thr Pro Ala Asn Gin Ala He Ser Met He Asp Gly Pro 
290 295 300 

Ala Pro Asp Gly Tyr Pro He He Asn Tyr Glu Tyr Ala He Val Asn 
305 310 315 320 

Asn Arg Gin Lys Asp Ala Ala Thr Ala Gin Thr Leu Gin Ala Phe Leu 
325 330 335 

His Trp Ala He Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp Gin Val 
340 345 350 

His Phe Gin Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp Ala Leu 
355 360 365 

He Ala Thr He Ser Ser 
370 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
GGTCTTGACC ACCACCTGGG TGTCGAAGTC GGTGCCCGGA TTGAAGTCCA GGTACTCGTG 60 
GGTGGGGCGG GCGAAACAAT AGCGACAAGC ATGCGAGCAG CCGCGGTAGC CGTTGACGGT 120 
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GTAGCGAAAC GGCAACGCGG CCGCGTTGGG CACCTTGTTC AGCGCTGATT TGCACAACAC 180 

CTCGTGGAAG GTGATGCCGT CGAATTGTGG CGCGCGAACG CTGCGGACCA GGCCGATCCG 240 

CTGCAACCCG GCAGCGCCCG TCGTCAACGG GCATCCCGTT CACCGCGACG GCTTGCCGGG 300 

CCCAACGCAT ACCATTATTC GAACAACCGT TCTATACTTT GTCAACGCTG GCCGCTACCG 360 

AGCGCCGCAC AGGATGTGAT ATGCCATCTC TGCCCGCACA GACAGGAGCC AGGCCTTATG 420 

ACAGCATTCG GCGTCGAGCC CTACGGGCAG CCGAAGTACC TAGAAATCGC CGGGAAGCGC 480 

ATGGCGTATA TCGACGAAGG CAAGGGTGAC GCCATCGTCT TTCAGCACGG CAACCCCACG 540 

TCGTCTTACT TGTGGCGCAA CATCATGCCG CACTTGGAAG GGCTGGGCCG GCTGGTGGCC 600 

TGCGATCTGA TCGGGATGGG CGCGTCGGAC AAGCTCAGCC CATCGGGACC CGACCGCTAT 660 

AGCTATGGCG AGCAACGAGA CTTTTTGTTC GCGCTCTGGG ATGCGCTCGA CCTCGGCGAC 720 

Hp CACGTGGTAC TGGTGCTGCA CGACTGGGGC TCGGCGCTCG GCTTCGACTG GGCTAACCAG 780 

=P CATCGCGACC GAGTGCAGGG GATCGCGTTC ATGGAAGCGA TCGTCACCCC GATGACGTGG 840 

*_ GCGGACTGGC CGCCGGCCGT GCGGGGTGTG TTCCAGGGTT TCCGATCGCC TCAAGGCGAG 900 

flj CCAATGGCGT TGGAGCACAA CATCTTTGTC GAACGGGTGC TGCCCGGGGC GATCCTGCGA 960 

ru 

fj] CAGCTCAGCG ACGAGGAAAT GAACCACTAT CGGCGGCCAT TCGTGAACGG CGGCGAGGAC 1020 

o 

f|j CGTCGCCCCA CGTTGTCGTG GCCACGAAAC CTTCCAATCG ACGGTGAGCC CGCCGAGGTC 1080 

GTCGCGTTGG TCAACGAGTA CCGGAGCTGG CTCGAGGAAA CCGACATGCC GAAACTGTTC 1140 

ATCAACGCCG AGCCCGGCGC GATCATCACC GGCCGCATCC GTGACTATGT CAGGAGCTGG 1200 

CCCAACCAGA CCGAAATCAC AGTGCCCGGC GTGCATTTCG TTCAGGAGGA CAGCGATGGC 1260 

GTCGTATCGT GGGCGGGCGC TCGGCAGCAT CGGCGACCTG GGAGCGCTCT CATTTCACGA 1320 

GACCAAGAAT GTGATTTCCG GCGAAGGCGG CGCCCTGCTT GTCAACTCAT AAGACTTCCT 1380 

GCTCCGGGCA GAGATTCTCA GGGAAAAGGG CACCAATCGC AGCCGCTTCC TTCG CAACGA 1440 

GGTCGACAAA TATACGTGGC AGGACAAAGG TCTTCCTATT TGCCCAGCGA ATTAGTCGCT 1500 

GCCTTTCTAT GGGCTCAGTT CGAGGAAGCC GAGCGGATCA CGCGTATCCG ATTGGACCTA 1560 

TGGAACCGGT ATCATGAAAG CTTCGAATCA TTGGAACAGC GGGGGCTCCT GCGCCGTCCG 1620 

ATCATCCCAC AGGGCTGCTC TCACAACGCC CACATGTACT ACGTGTTACT AGCGCCCAGC 1680 

GCCGATCGGG AGGAGGTGCT GGCGCGTCTG ACGAGCGAAG GTATAGGCGC GGTCTTTCAT 1740 
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TACGTGCCGC TTCACGATTC GCCGGCCGGG CGTCGCT 1777 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

GAGATTGAAT CGTACCGGTC TCCTTAGCGG CTCCGTCCCG TGAATGCCCA TATCACGCAC 60 

GGCCATGTTC TGGCTGTCGA CCTTCGCCCC ATGCCCGGAC GTTGGTAAAC CCAGGGTTTG 120 

ATCAGTAATT CCGGGGGACG GTTGCGGGAA GGCGGCCAGG ATGTGCGTGA GCCGCGGCGC 180 

CGCCGTCGCC CAGGCGACCG CTGGATGCTC AGCCCCGGTG CGGCGACGTA GCCAGCGTTT 240 

GGCGCGTGTC GTCCACAGTG GTACTCCGGT GACGACGCGG CGCGGTGCCT GGGTGAAGAC 3 00 

CGTGAC CGAC GCCGCCGATT CAGA 324 
(2) INFORMATION FOR SEQ ID NO: 158: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

GCGGTACCGC CGCGTTGCGC TGGCACGGGA CCTGTACGAC CTGAACCACT TCGCCTCGCG 60 

AACGATTGAC GAACCGCTCG TGCGGCGGCT GTGGGTGCTC AAGGTGTGGG GTGATGTCGT 120 

CGATGACCGG CGCGGCACCC GGCCACTACG CGTCGAAGAC GTCCTCGCCG CCCGCAGCGA 180 

GCACGACTTC CAGCCCGACT CGATCGGCGT GCTGACCCGT CCTGTCGCTA TGGCTGCCTG 240 

GGAAGCTCGC GTTCGGAAGC GATTTGCGTT CCTCACTGAC CTCGACGCCG ACGAGCAGCG 300 

GTGGGCCGCC TGCGACGAAC GGCACCGCCG CGAAGTGGAG AACGCGCTGG CGGTGCTGCG 3 60 

GTCCTGATCA ACCTGCCGGC GATCGTGCCG TTCCGCTGGC ACGGTTGCGG CTGGACGCGG 42 0 

CTGAATCGAC TAGATGAGAG CAGTTGGGCA CGAATCCGGC TGTGGTGGTG AGCAAGACAC 480 

GAGTACTGTC ATCACTATTG GATGCACTGG ATGACCGGCC TGATTCAGCA GG AC CAATGG 540 

AACTGCCCGG GGCAAAACGT CTCGGAGATG ATCGGCGTCC CCTCGGAACC CTGCGGTGCT 600 

GGCGTCATTC GGACATCGGT CCGGCTCGCG GGATCGTGGT GACGCCAGCG CTGAAGGAGT 660 
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GGAGCGCGGC GGTGCACGCG CTGCTGGACG GCCGGCAGAC GGTGCTGCTG CGTAAGGGCG 720 

GGATCGGCGA GAAGCGCTTC GAGGTGGCGG CCCACGAGTT CTTGTTGTTC CCGACGGTCG 780 

CGCACAGCCA CGCCGAGCGG GTTCGCCCCG AGCACCGCGA CCTGCTGGGC CCGGCGGCCG 84 0 

CCGACAGCAC CGACGAGTGT GTGCTACTGC GGGCCGCAGC GAAAGTTGTT GCCGCACTGC 900 

CGGTTAACCG GCCAGAGGGT CTGGACGCCA TCGAGGATCT GCACATCTGG ACCGCCGAGT 960 

CGGTGCGCGC CGACCGGCTC GACTTTCGGC CCAAGCACAA ACTGGCCGTC TTGGTGGTCT 1020 

CGGCGATCCC GCTGGCCGAG CCGGTCCGGC TGGCGCGTAG GCCCGAGTAC GGCGGTTGCA 1080 

CCAGCTGGGT GCAGCTGCCG GTGACGCCGA CGTTGGCGGC GCCGGTGCAC GACGAGGCCG 1140 

CGCTGGCCGA GGTCGCCGCC CGGGTCCGCG AGGCCGTGGG TTGACTGGGC GGCATCGCTT 1200 

GGGTCTGAGC TGTACGCCCA GTCGGCGCTG CGAGTGATCT GCTGTCGGTT CGGTCCCTGC 1260 

TGGCGTCAAT TGACGGCGCG GGCAACAGCA GCATTGGCGG CGCCATCCTC CGCGCGGCCG 132 0 

GCGCCCACCG CTACAACC 1338 
(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

CCGGCGGCAC CGGCGGCACC GGCGGTACCG GCGGCAACGG CGCTGACGCC GCTGCTGTGG 60 

TGGGCTTCGG CGCGAACGGC GACCCTGGCT TCGCTGGCGG CAAAGGCGGT AACGGCGGAA 120 

TAGGTGGGGC CGCGGTGACA GGCGGGGTCG CCGGCGACGG CGGCACCGGC GGCAAAGGTG 180 

GCACCGGCGG TGCCGGCGGC GCCGGCAACG ACGCCGGCAG CACCGGCAAT CCCGGCGGTA 240 

AGGGCGGCGA CGGCGGGATC GGCGGTGCCG GCGGGGCCGG CGGCGCGGCC GGCACCGGCA 300 

ACGGCGGCCA TGCCGGCAAC C 32 1 
(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 492 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

GAAGACCCGG CCCCGCCATA TCGATCGGCT CGCCGACTAC TTTCGCCGAA CGTGCACGCG 60 

GCGGCGTCGG GCTGATCATC ACCGGTGGCT ACGCGCCCAA CCGCACCGGA TGGCTGCTGC 120 

CGTTCGCCTC CGAACTCGTC ACTTCGGCGC AAGCCCGACG GCACCGCCGA ATCACCAGGG 180 

CGGTC CACGA TTCGGGTGCA AAGATC CTGC TGCAAATCCT GCACGCCGGA CGCTACGCCT 240 

ACCACCCACT TGCGGTCAGC GCCTCGCCGA TCAAGGCGCC GATCACCCCG TTTCGTCCGC 300 

GAGCACTATC GGCTCGCGGG GTCGAAGCGA CCATCGCGGA TTTCGCCCGC TGCGCGCAGT 360 

TGGCCCGCGA TGCCGGCTAC GACGGCGTCG AAATCATGGG CAGCGAAGGG TATCTGCTCA 420 

ATCAGTTCCT GGCGCCGCGC ACCAACAAGC GCACCGACTC GTGGGGCGGC ACACCGGCCA 480 

ACCGTCGCCG GT 492 
(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:161: 

Phe Ala Gin His Leu Val Glu Gly Asp Ala Val Glu Leu Trp Arg Ala 
15 10 15 

Asn Ala Ala Asp Gin Ala Asp Pro Leu Gin Pro Gly Ser Ala Arg Arg 
20 25 30 

Gin Arg Ala Ser Arg Ser Pro Arg Arg Leu Ala Gly Pro Asn Ala Tyr 
35 40 45 

His Tyr Ser Asn Asn Arg Ser lie Leu Cys Gin Arg Trp Pro Leu Pro 
50 55 60 

Ser Ala Ala Gin Asp Val He Cys His Leu Cys Pro His Arg Gin Glu 
65 70 75 80 

Pro Gly Leu Met Thr Ala Phe Gly Val Glu Pro Tyr Gly Gin Pro Lys 
85 90 95 

Tyr Leu Glu He Ala Gly Lys Arg Met Ala Tyr He Asp Glu Gly Lys 
100 105 110 

Gly Asp Ala He Val Phe Gin His Gly Asn Pro Thr Ser Ser Tyr Leu 
115 120 125 

Trp Arg Asn He Met Pro His Leu Glu Gly Leu Gly Arg Leu Val Ala 



160 



Cys Asp Leu lie Gly Met Gly Ala Ser Asp Lys Leu Ser Pro Ser Gly 
145 150 155 160 

Pro Asp Arg Tyr Ser Tyr Gly Glu Gin Arg Asp Phe Leu Phe Ala Leu 
165 170 175 

Trp Asp Ala Leu Asp Leu Gly Asp His Val Val Leu Val Leu His Asp 
180 185 190 

Trp Gly Ser Ala Leu Gly Phe Asp Trp Ala Asn Gin His Arg Asp Arg 
195 200 205 

Val Gin Gly He Ala Phe Met Glu Ala He Val Thr Pro Met Thr Trp 
210 215 220 

Ala Asp Trp Pro Pro Ala Val Arg Gly Val Phe Gin Gly Phe Arg Ser 
225 230 235 240 

Pro Gin Gly Glu Pro Met Ala Leu Glu His Asn He Phe Val Glu Arg 
245 250 255 

Val Leu Pro Gly Ala He Leu Arg Gin Leu Ser Asp Glu Glu Met Asn 
260 265 270 

His Tyr Arg Arg Pro Phe Val Asn Gly Gly Glu Asp Arg Arg Pro Thr 
275 280 285 

Leu Ser Trp Pro Arg Asn Leu Pro He Asp Gly Glu Pro Ala Glu Val 
290 295 300 

Val Ala Leu Val Asn Glu Tyr Arg Ser Trp Leu Glu Glu Thr Asp Met 
305 310 315 320 

Pro Lys Leu Phe He Asn Ala Glu Pro Gly Ala He He Thr Gly Arg 
325 330 335 

He Arg Asp Tyr Val Arg Ser Trp Pro Asn Gin Thr Glu He Thr Val 
340 345 350 

Pro Gly Val His Phe Val Gin Glu Asp Ser Asp Gly Val Val Ser Trp 
355 360 365 

Ala Gly Ala Arg Gin His Arg Arg Pro Gly Ser Ala Leu He Ser Arg 
370 375 380 

Asp Gin Glu Cys Asp Phe Arg Arg Arg Arg Arg Pro Ala Cys Gin Leu 

385 390 395 400 

He Arg Leu Pro Ala Pro Gly Arg Asp Ser Gin Gly Lys Gly His Gin 
405 410 415 



Ser Gin Pro Leu Pro Ser Gin Arg Gly Arg Gin He Tyr Val Ala Gly 
420 425 430 
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Gin Arg Ser Ser Tyr Leu Pro Ser Glu Leu Val Ala Ala Phe Leu Trp 
435 440 445 

Ala Gin Phe Glu Glu Ala Glu Arg lie Thr Arg lie Arg Leu Asp Leu 
450 455 460 " 

Trp Asn Arg Tyr His Glu Ser Phe Glu Ser Leu Glu Gin Arg Gly Leu 
465 470 475 480 

Leu Arg Arg Pro lie lie Pro Gin Gly Cys Ser His Asn Ala His Met 
485 490 495 

Tyr Tyr Val Leu Leu Ala Pro Ser Ala Asp Arg Glu Glu Val Leu Ala 
500 . 505 510 

Arg Leu Thr Ser Glu Gly lie Gly Ala Val Phe His Tyr Val Pro Leu 
515 520 525 

His Asp Ser Pro Ala Gly Arg Arg 
530 535 

(2) INFORMATION FOR SEQ ID NO : 162 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE : amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Asn Glu Ser Ala Pro Arg Ser Pro Met Leu Pro Ser Ala Arg Pro Arg 
15 10 15 

Tyr Asp Ala lie Ala Val Leu Leu Asn Glu Met His Ala Gly His Cys 
20 25 30 

Asp Phe Gly Leu Val Gly Pro Ala Pro Asp He Val Thr Asp Ala Ala 
35 40 45 

Gly Asp Asp Arg Ala Gly Leu Gly Val Asp Glu Gin Phe Arg His Val 
50 55 60 

Gly Phe Leu Glu Pro Ala Pro Val Leu Val Asp Gin Arg Asp Asp Leu 
65 70 75 80 

Gly Gly Leu Thr Val Asp Trp Lys Val Ser Trp Pro Arg Gin Arg Gly 
85 90 95 

Ala Thr Val Leu Ala Ala Val His Glu Trp Pro Pro He Val Val His 
100 105 110 



Phe Leu Val Ala Glu Leu Ser Gin Asp Arg Pro Gly Gin His Pro Phe 
115 120 125 
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Asp Lys Asp Val Val Leu Gin Arg His Trp Leu Ala Leu Arg Arg Ser 
130 135 140 

Glu Thr Leu Glu His Thr Pro His Gly Arg Arg Pro Val Arg Pro Arg 
145 150 155 - 160 

His Arg Gly Asp Asp Arg Phe His Glu Arg Asp Pro Leu His Ser Val 
165 170 175 

Ala Met Leu Val Ser Pro Val Glu Ala Glu Arg Arg Ala Pro Val Val 
180 185 190 

Gin His Gin Tyr His Val Val Ala Glu Val Glu Arg He Pro Glu Arg 
195 200 205 

Glu Gin Lys Val Ser Leu Leu Ala He Ala He Ala Val Gly Ser Arg 
210 215 220 

Trp Ala Glu Leu Val Arg Arg Ala His Pro Asp Gin He Ala Gly His 
225 230 235 240 

Gin Pro Ala Gin Pro Phe Gin Val Arg His Asp Val Ala Pro Gin Val 
245 250 255 

Arg Arg Arg Gly Val Ala Val Leu Lys Asp Asp Gly Val Thr Leu Ala 
260 265 270 

Phe Val Asp He Arg His Ala Leu Pro Gly Asp Phe 
275 280 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

ATGAACATGT CGTCGGTGGT GGGTCGCAAG GCCTTTGCGC GATTCGCCGG CTACTCCTCC 60 

GCCATGCACG CGATCGCCGG TTTCTCCGAT GCGTTGCGCC AAGAGCTGCG GGGTAGCGGA 120 

ATCGCCGTCT CGGTGATCCA CCCGGCGCTG ACCCAGACAC CGCTGTTGGC CAACGTCGAC 180 

CCCGCCGACA TGCCGCCGCC GTTTCGCAGC CTCACGCCCA TTCCCGTTCA CTGGGTCGCG 240 

GCAGCGGTGC TTGACGGTGT GGCG 264 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

TAGTCGGCGA CGATGACGTC GCGGTCCAGG CCGACCGCTT GAAGCAC C AG CGCGACCACG 60 

AAGC CGGTGC GATCCTTACC CGCGAAGCAG TGGGTGAGCA CCGGGCGTCC GGCGGCAAGC 120 

AGTGTGACGA CACGATGTAG CGCGC GCTGT GCTCCATTGC GCGTTGGGAA TTGGCGATAC 180 

TCGTCGGTCA TGTAGCGGGT GGCCGCGTCA TTTAT CGACT GGCTGGATTC GCCGGACTCG 24 0 

CCGTTGGACC CGTCATTGGT TAGCAGCCTC TTGAATGCGG TTTCGTGCGG CGCTGAGTCG 300 

T CGG CGT CAT CATCGGCGAG GTCGGGGAAC GGCAGCAGGT GGACGTCGAT GCCGTCCGGA 360 

ACCCGTCCTG GACCGCGGCG GGCAACCTCC CGGGACGACC GCAGGTCGGC AACGTCGGTG 420 

ATCCCCAGCC GGCGCAGCGT TGCCCCTCGT GCCGAATTCG GCACGAGGCT GGCGAGCCAC 480 

CGGGCAT C AC CAAGCAACGC TTGCCCAGTA CGGATCGTCA CTTCCGCATC CGGCAGACCA 540 

ATCTCCTCGC CGCCCATCGT CAGATCCCGC TCGTGCGTTG ACAAGAACGG CCGCAGATGT 6 00 

GCCAGCGGGT AT CGGAGATT GAACCGCGCA CGCAGTTCTT CAATCGCTGC GCGCTGCCGC 660 

ACTATTGGCA CTTTCCGGCG GTC GCGGTAT TCAGCAAGCA TGCGAGTCTC GACGAACTCG 720 

CCCCACGTAA CCCACGGCGT AGCTCCCGGC GTGACGCGGA GGATCGGCGG GTGATCTTTG 780 

CCGCCACGCT CGTAGCCGTT GATCCACCGC TTCGCGGTGC CGGCGGGGAG GCCGATCAGC 840 

TTATCGACCT CGGCGTATGC CGACGGCAAG CTGGGCGCGT TCGTCGAGGT CAAGAACTCC 900 

AC CAT CGG CA CCGGCACCAA GGTGCCGCAC CTGACCTACG TCGGCGACGC CGACATCGGC 960 

GAGTACAGCA ACATCGGCGC CTCCAGCGTG TTCGTCAACT ACGACGGTAC GTCCAAACGG 1020 

CGCACCACCG TCGGTTCGCA CGTACGGACC GGGTC CGACA CCATGTTCGT GGCCCCAGTA 1080 

ACCATCGGCG ACGGCGCGTA TACCGGGGCC GGCACAGTGG TGCGGGAGGA TGTCCCGCCG 114 0 

GGGGCGCTGG CAGTGTCGGC GGGTCCGCAA C 1171 
(2) INFORMATION FOR SEQ ID NO: 16 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:165: 



164 



GCAAAGGCGG CACCGGCGGG GCCGGCATGA ACAGCCTCGA CCCGCTGCTA GCCGCCCAAG 



60 



ACGGCGGCCA AGGCGGCACC GGCGGCACCG GCGGCAACGC CGGCGCCGGC GGCACCAGCT 



120 



TCACCCAAGG CGCCGACGGC AACGCCGGCA ACGGCGGTGA CGGCGGGGTC GGCGGCAACG 



180 



GCGGAAACGG CGGAAACGGC GCAGACAACA CCACCACCGC CGCCGCC 
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(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 04 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:166: 

CCTCGCCACC ATGGGCGGGC AGGGCGGTAG CGGTGGCGCC GGCTCTACCC CAGGCGCCAA 60 

GGGCGCCCAC GGCTTCACTC CAACCAGCGG CGGCGACGGC GGCGACGGCG GCAACGGCGG 120 

CAACTCCCAA GTGGTCGGCG GCAACGGCGG CGACGGCGGC AATGGCGGCA ACGGCGGCAG 180 

CGCCGGCACG GGCGGCAACG GCGGCCGCGG CGGCGACGGC GCGTTTGGTG GCATGAGTGC 240 

CAACGCCACC AACC CTGGTG AAAACGGGCC AAACGGTAAC CCCGGCGGCA ACGGTGGCGC 3 00 

CGGC 304 
(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1439 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

GTGGGACGCT GCCGAGGCTG TATAACAAGG ACAACATCGA CCAGCGCCGG CTCGGTGAGC 60 

TGATCGACCT ATTTAACAGT GCGCGCTTCA GCCGGCAGGG CGAGCACCGC GCCCGGGATC 12 0 

TGATGGGTGA GGTCTACGAA TACTTCCTCG GCAATTTCGC TCGCGCGGAA GGGAAGCGGG 18 0 

GTGGCGAGTT CTTTACCCCG CCCAGCGTGG TCAAGGTGAT CGTGGAGGTG CTGGAGCCGT 240 

CGAGTGGGCG GGTGTATGAC CCGTGCTGCG GTTCCGGAGG CATGTTTGTG CAGACCGAGA 300 

AGTTCATCTA CGAACACGAC GGCGATCCGA AGGATGTCTC GATCTATGGC CAGGAAAGCA 360 

TTGAGGAGAC CTGGCGGATG GCGAAGATGA ACCTCGCCAT CCACGGCATC GACAACAAGG 420 
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GGCTCGGCGC CCGATGGAGT GATACCTTCG CCCGCGACCA GCACCCGGAC GTGCAGATGG 480 

ACTACGTGAT GGCCAATCCG CCGTTCAACA TCAAAGACTG GGCCCGCAAC GAGGAAGACC 540 

CACGCTGGCG CTTCGGTGTT CCGCCCGCCA ATAACGCCAA CTACGCATGG ATTCAGCACA 600 

TCCTGTACAA CTTGGCGCCG GGAGGTCGGG CGGGCGTGGT GATGGCCAAC GGGTCGATGT 660 

CGTCGAACTC CAACGGCAAG GGGGATATTC GCGCGCAAAT CGTGGAGGCG GATTTGGTTT 720 

CCTGCATGGT CGCGTTACCC ACCCAGCTGT TCCGCAGCAC CGGAATCCCG GTGTGCCTGT 780 

GGTTTTTCGC CAAAAACAAG GCGGCAGGTA AGCAAGGGTC TATCAACCGG TGCGGGCAGG 840 

TGCTqgJCAT CGACGCTCGT GAACTGGGCG ACCTAGTGGA CCGGGCCGAG CGGGCGCTGA 900 

CCAACGAGGA GATCGTCCGC ATCGGGGATA CCTTCCACGC GAGCACGACC ACCGGCAACG 960 

CCGGCTCCGG TGGTGCCGGC GGTAATGGGG GCACTGGCCT CAACGGCGCG GGCGGTGCTG 102 0 

GCGGGGCCGG CGGCAACGCG GGTGTCGCCG GCGTGTCCTT CGGCAACGCT GTGGGCGGCG 1080 

ACGGCGGCAA CGGCGGCAAC GGCGGCCACG GCGGCGACGG CACGACGGGC GGCGCCGGCG 1140 

GCAAGGGCGG CAACGGCAGC AGCGGTGCCG CCAGCGGCTC AGGCGTCGTC AACGTCACCG 1200 

CCGGCCACGG CGGCAACGGC GGCAATGGCG GCAACGGCGG CAACGGCTCC GCGGGCGCCG 1260 

GCGGCCAGGG CGGTGC CGGC GGCAGCGCCG GCAACGGCGG CCACGGCGGC GGTGCCACCG 132 0 

GCGGCGCCAG CGGCAAGGGC GGCAACGGCA CCAGCGGTGC CGCCAGCGGC TCAGGCGTCA 1380 

TCAACGTCAC CGCCGGCCAC GGCGGCAACG GCGGCAATGG CCGCAACGGC GGCAACGGC 143 9 
(2) INFORMATION FOR SEQ ID NO -.16 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:168: 

GGGCCGGCGG GGCCGGATTT TCTCGTGCCT TGATTGTCGC TGGGGATAAC GGCGGTGATG 60 

GTGGTAACGG CGGGATGGGC GGGGCTGGCG GGGCTGGCGG CCCCGGCGGG GCCGGCGGCC 120 

TGATCAGCCT GCTGGGCGGC CAAGGCGCCG GCGGGGCCGG CGGGACCGGC GGGGCCGGCG 180 

GTGTTGGCGG TGACGGCGGG GCCGGCGGCC CCGGCAACCA GGCCTTCAAC GCAGGTGCCG 240 

GCGGGGCCGG CGGCCTGATC AGCCTGCTGG GCGGC CAAGG CGCCGGCGGG GCCGGCGGGA 3 00 

CCGGCGGGGC CGGCGGTGTT GGCGGTGAC 32 9 
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(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 
GCAACGGTGG CAACGGCGGC ACCAGCACGA CCGTGGGGAT GGCCGGAGGT AACTGTGGTG 60 
CCGCCGGGCT GATCGGCAAC 80 
(2) INFORMATION FOR SEQ ID NO: 170: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

J5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:170: 

s GGGCTGTGTC GCACTCACAC CGCCGCATTC GGCGACGTTG GCCGCCCAAT ATCCAGCTCA 60 

=1 1 AGGCCTACTA CTTACCGTCG GAGGACCGCC GCATCAAGGT GCGGGTCAGC GCCCAAGGAA 120 

W 

m TCAAGGTCAT CGACCGCGAC GGGCATCGAG GCCGTCGTCG CGCGGCTCGG GCAGGATCCG 180 

Q 

f|j CCCCGGCGCA CTTCGCGCGC CAAGCGGGCT CATCGCTCCG AACGGCGGCG ATCCTGTGAG 240 

CACAACTGAT GGCGCGCAAC GAGATT CGTC CAATTGTCAA GCCGTGTTCG ACCGCAGGGA 3 00 

CCGGTTATAC GTATGTCAAC CTATGTCACT CGCAAGAACC GGCATAACGA TCCCGTGATC 360 

CGCCGACAGC CCACGAGTGC AAGACCGTTA CA 3 92 
(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

ACCGGCGCCA CCGGCGGCAC CGGGTTCGCC GGTGGCGCCG GCGGGGCCGG CGGGCAGGGC 60 

GGTATCAGCG GTGCCGGCGG CACCAACGGC TCTGGTGGCG CTGGCGGCAC CGGCGGACAA 120 

GGCGGCGCCG GGGGCGCTGG CGGGGCCGGC GCCGATAACC CCACCGGCAT CGGCGGCGCC 180 
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GGCGGCACCG GCGGCACCGG CGGAGCGGCC GGAGCCGGCG GGGCCGGTGG CGCCATCGGT 240 

ACCGGCGGCA CCGGCGGCGC GGTGGGCAGC GTCGGTAACG CCGGGATCGG CGGTACCGGC 3 00 

GGTACGGGTG GTGTCGGTGG TGCTGGTGGT GCAGGTGCGG CTGCGGCCGC TGGCAGCAGC 36 0 

GCTACCGGTG GCGCCGGGTT CGCCGGCGGC GCCGGCGGAG AAGGCGGACC GGGCGGCAAC 420 

AGCGGTGTGG GCGGCACCAA CGGCTCCGGC GGCGCCGGCG GTGCAGGCGG CAAGGGCGGC 480 

ACCGGAGGTG CCGGCGGGTC CGGCGCGGAC AACCCCACCG GTGCTGGTTT CGCCG 53 5 
(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

CCGACGTCGC CGGGGCGATA CGGGGGTCAC CGACTACTAC ATCATCCGCA CCGAGAATCG 60 

GCCGCTGCTG CAACCGCTGC GGGC GGTGCC GGTCATCGGA GATCCGCTGG CCGACCTGAT 12 0 

CCAGCCGAAC CTGAAGGTGA TCGTCAACCT GGGCTACGGC GACCCGAACT ACGGCTACTC 18 0 

GACGAGCTAC GCCGATGTGC GAACGCCGTT CGGGCTGTGG CCGAACGTGC CGCCTCAGGT 24 0 

CATCGCCGAT GCCCTGGCCG CCGGAACACA AGAAGGCATC CTTGACTTCA CGGCCGACCT 300 

GCAGGCGCTG TCCGCGCAAC CGCTCACGCT CCCGCAGATC CAGCTGCCGC AACCCGCCGA 360 

TCTGGTGGCC GCGGTGGCCG CCGCACCGAC GCCGGCCGAG GTGGTGAACA CGCTCGCCAG 420 

GATCATCTCA ACCAACTACG CCGTCCTGCT GCCCACCGTG GACATCGCCC TCGCCTGGTC 480 

ACCACCCTGC CGCTGTACAC CACCCAACTG TTCGTCAGGC AACTCGCTGC GGGCAATCTG 54 0 

ATCAACGCGA TCGGCTATCC CCTGGCGGCC AC CGTAGGTT TAGGCACGAT CGATAGCGGG 600 

CGGCGTGGAA TTGCTCACCC TCCTCGCGGC GGCCTCGGAC ACCGTTCGAA ACATCGAGGG 660 

CCTCGTCACC TAACGGATTC CCGACGGCAT 6 90 
(2) INFORMATION FOR SEQ ID NO: 17 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 
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ACGGTGACGG CGGTACTGGC GGCGGCCACG GCGGCAACGG CGGGAATCCC GGGTGGCTCT 60 

TGGGCACAGC CGGGGGTGGC GGCAACGGTG GCGCCGGCAG CACCGGTACT GCAGGTGGCG 12 0 

GCTCTGGGGG CACCGGCGGC GACGGCGGGA CCGGCGGGCG TGGCGGCCTG TTAATGGGCG 18 0 

CCGGCGCCGG CGGGCACGGT GGCACTGGCG GCGCGGGCGG TGCCGGTGTC GACGGTGGCG 240 

GCGCCGGCGG GGCCGGCGGG GCCGGCGGCA ACGGCGG CGC CGGGGGTCAA GCCGCCCTGC 300 

TGTTCGGGCG CGGCGGCACC GGCGGAGCCG GCGGCTACGG CGGCGATGGC GGTGGCGGCG 360 

GTGACGGCTT CGACGGCACG ATGGCCGGCC TGGGTGGTAC CGGTGGC 407 
(2) INFORMATION FOR SEQ ID NO: 174: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
GATCGGTCAG CGCATCGCCC TCGGCGGCAA GCGATTCCGC GGTCTCACCG AAGAACATCG 60 

TGCACGCGGC GGCGCGGACC AGCCCGCTGC GCTGCGGCGC GTCGAACGCC TCCAGCAGGC 120 

ACAGCCAGTC CTTGGCGGCC TGCGAGGCGA ACACGTCGGT GTCACCGGTG TAGATCGCCG 180 

GGATGCC CGC CTCCGCCAAC GCATTCCGGC ACGCCCGCGC GTCTTTGTGA TGCTCGACGA 240 

TCACCGCGAT GTCTGCGGCC ACCACGGGCC GCCCGGCGAA GGTGGCCCCG CTGGCCAGTA 3 00 

GCGCCGCGAC GTCGGCGGCC AGGTCGTCGG GGATGTGCCG GCGCAGCGCT CCGGCGCGAC 360 

GCCCGAAAAA CGACCCCTCA CCCAGCTGGG TCCCGCTGGC ATATCCCTTG CCGTCCTGGG 420 

CGATATTGGA CGCGCATGCC CCGACCGCGT ACAGGCCGGC CACCACCG 468 
(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 



GGTGGTAACG GCGGCCAGGG TGGCATCGGC GGCGCCGGCG AGAGAGGCGC CGACGGCGCC 60 
GGCCCCAATG CTAACGGCGC AAACGGCGAG AACGGCGGTA GCGGTGGTAA CGGTGGCGAC 120 
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GGCGGCGCCG GCGGCAATGG CGGCGCGGGC GGCAACGCGC AGGCGGCCGG GTACACCGAC 180 
GGCGCCACGG GCACCGGCGG CGACGGCGGC AACGGCGGC 219 
(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

TAGCTCCGGC GAGGGCGGCA AGGGCGGCGA CGGTGGCCAC GGCGGTGACG GCGTCGGCGG 60 

CAACAGTTCC GTCACCCAAG GCGGCAGCGG CGGTGGCGGC GGCGC CGGCG GCGCCGGCGG 12 0 

CAGCGGCTTT TTCGGCGGCA AGGGCGGCTT CGGCGGCGAC GGCGGTCAGG GCGGCCCCAA 180 

CGGCGGCGGT ACCGTCGGCA CCGTGGCCGG TGGCGGCGGC AACGGCGGTG TCGGCGGCCG 240 

GGGCGGCGAC GGCGTCTTTG CCGGTGCCGG CGGCCAGGGC GGCCTCGGTG GGCAGGGCGG 3 00 

CAATGGCGGC GGCTCCACCG GCGGCAACGG CGGCCTTGGC GGCGCGGGCG GTGGCGGAGG 3 60 

CAACGCCCCG GCTCGTGCCG AATCCGGGCT G AC CATGGAC AGCGCGGCCA AGTTCGCTGC 42 0 

CATCGCATCA GGCGCGTACT GCCCCGAACA CCTGGAACAT CACC CGAGTT AGCGGGGCGC 480 

ATTTCCTGAT CACC 494 
(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

GGGCCGGTGG TGCCGCGGGC CAGCTCTTCA GCGCCGGAGG CGCGGCGGGT GCCGTTGGGG 60 

TTGGCGGCAC CGGCGGC C AG GGTGGGGCTG GCGGTGCCGG AGCGGCCGGC GCCGACGCCC 120 

CCGCCAGCAC AGGTCTAACC GGTGGTACCG GGTTCGCTGG CGGGGCCGGC GGCGTCGGCG 18 0 

GCCAGAGCGG CAACGCCATT GCCGGCGGCA TCAACGGCTC 220 
(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

ATGGCGGCAA CGGGGGCCCC GGCGGTGCTG GCGGGGC CGG CGACTACAAT TTCCAACGGC 60 

GGGCAGGGTG GTGCCGGCGG CCAAGGCGGC CAAGGCGGCC TGGGCGGGGC AAGCACCACC 12 0 

TGATCGGCCT AGCCGCACCC GGGAAAGCCG ATCCAACAGG CGACGATGCC GCCTTCCTTG 180 

CCGCGTTGGA CCAGGCCGGC ATCACCTACG CTGACCCAGG CCACGCCATA ACGGCCGCCA 24 0 

AGGCGATGTG TGGGCTGTGT GCTAACGGCG TAACAGGTCT ACAGCTGGTC GCGGACCTGC 30 0 

GGGACTACAA TCCCGGGCTG ACCATGGACA GCGCGGCCAA GTTCGCTGCC ATCGCATCAG 360 

GCGCGTACTG CCCCGAACAC CTGGAACA 388 
(2) INFORMATION FOR SEQ ID NO: 17 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

GCAAAGGCGG CACCGGCGGG GCCGGCATGA ACAGCCTCGA CCCGCTGCTA GCCGCCCAAG 60 

ACGGCGGCCA AGGCGGCACC GGCGGCACCG GCGGCAACGC CGGCGCCGGC GGCACCAGCT 120 

TCACCCAAGG CGCCGACGGC AACGCCGGCA ACGGCGGTGA CGGCGGGGTC GGCGGCAACG 180 

GCGGAAACGG CGGAAACGGC GCAGACAACA CCACCACCGC CGCCGCCGGC ACCACAGGCG 240 

GCGACGGCGG GGCCGGCGGG GCCGGCGGAA CCGGCGGAAC CGGCGGAGCC GCCGGCACCG 300 

GCACCGGCGG CCAACAAGGC AACGGCGGCA ACGGCGGCAC CGGCGGCAAA GGCGGCACCG 360 

GCGGCGACGG TGCACTCTCA GGCAGCACCG GTGGTGCCGG 40 0 
(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 
GGCAACGGCG GCAACGGCGG CATCGCCGGC ATTGGGCGGC AACGGCGTTC CGGGACGGGC 60 
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AGCGGCAACG GCGGC CAACG GCGGCAGCGG CGGCAACGGC GGCAACGCCG GCATGGGCGG 120 

CAACAGCGGC ACCGGCAGCG GCGACGGCGG TGCCGGCGGG AACGGCGGCG CGGCGGGCAC 18 0 

GGGCGGCACC GGCGGCGACG GCGGCCTCAC CGGTACTGGC GGCACCGGCG GCAGCGGTGG 240 

CACCGGCGGT GACGGCGGTA ACGGCGGCAA CGGAGCAGAT AACACCGCAA ACATGACTGC 300 

GCAGGCGGGC GGTGACGGTG GCAACGGCGG CGACGGTGGC TTCGGCGGCG GGGCCGGGGC 360 

CGGCGGCGGT GGCTTGACCG CTGGCGC CAA CGGCACCGGC GGGCAAGGCG GCGCCGGCGG 420 

CGATGGCGGC AACGGGGCCA TCGGCGGCCA CGGCCCACTC ACTGACGACC CCGGCGGCAA 480 

CGGGGGCACC GGCGGCAACG GCGGCACCGG CGGCACCGGC GGCGCGGGCA TCGGCAGC 538 
(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

GGGCCGGTGG TGCCGCGGGC CAGCTCTTCA GCGCCGGAGG CGCGGCGGGT GCCGTTGGGG 60 

TTGGCGGCAC CGGCGGCCAG GGTGGGGCTG GCGGTGCCGG AGCGGCCGGC GCCGACGCCC 120 

CCGCCAGCAC AGGTCTAACC GGTGGTACCG GGTTCGCTGG CGGGGC CGGC GGCGTCGGCG 180 

GCCACGGCGG CAACGCCATT GCCGGCGGCA TCAACGGCTC CGGTGGTGCC GGCGGCACC 23 9 
(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

AGCAGCGCTA CCGGTGGCGC CGGGTTCGCC GGCGGCGCCG GCGGAGAAGG CGGAGCGGGC 60 

GGCAACAGCG GTGTGGGCGG CACCAACGGC TCCGGCGGCG CCGGCGGTGC AGGCGGCAAG 120 

GGCGGCACCG GAGGTGCCGG CGGGTCCGGC GCGGACAACC CCACCGGTGC TGGTTTCGCC 180 

GGTGGCGCCG GCGGCACAGG TGGCGCGGCC GGCGCCGGCG GGGCCGGCGG GGCGACCGGT 240 

ACCGGCGGCA CCGGCGGCGT TGTCGGCGCC ACCGGTAGTG CAGGCATCGG CGGGGCCGGC 3 00 

GGCCGCGGCG GTGACGGCGG CGATGGGGCC AGCGGTCTCG GCCTGGGCCT CTCCGGCTTT 3 60 
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GACGGCGGCC AAGGCGGCCA AGGCGGGGCC GGCGGCAGCG CCGGCGCCGG CGGCATCAAC 420 

GGGGCCGGCG GGGCCGGCGG CAACGGCGGC GACGGCGGGG ACGGCGCAAC CGGTGCCGCA 480 

GGTCTCGGCG ACAACGGCGG GGTCGGCGGT GACGGTGGGG CCGGTGGCGC CGCCGGCAAC 540 

GGCGGCAACG CGGGCGTCGG CCTGACAGCC AAGGCCGGCG ACGGCGGCGC CGCGGGCAAT 60 0 

GGCGGCAACG GGGGCGCCGG CGGTGCTGGC GGGGCCGGCG ACAACAATTT CAACGGCGGC 660 

CAGGGTGGTG CCGGCGGCCA AGGCGGCCAA GGCGGCTTGG GCGGGGCAAG CACCACCTGA 720 

TCGGCCTAGC CGCACCCGGG AAAGC CGATC CAACAGGCGA CGATGCCGCC TTCCTTGCCG 780 

CGTTGGACCA GGCCGGCATC ACCTACGCTG ACCCAGGCCA CGCCATAACG GCCGCCAAGG 840 

CGATGTGTGG GCTGTGTGCT AACGGCGTAA CAGGTCTACA GCTGGTCGCG GACCTGCGGG 900 

AATACAATCC CGGGCTGACC ATGGACAGCG CGGCCAAGTT CGCTGCCATC GCATCAGGCG 960 

CGTACTGCCC CGAACACCTG GAACA 985 
(2) INFORMATION FOR SEQ ID NO: 18 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2138 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:183: 

CGGCACGAGG ATCGGTACCC CGCGGCATCG GCAGCTGCCG ATTCGCCGGG TTTCCCCACC 60 

CGAGGAAAGC CGCTACCAGA TGGCGCTGCC GAAGTAGGGC GATCCGTTCG CGATGC CGGC 120 

ATGAACGGGC GGCATCAAAT TAGTGCAGGA ACCTTTCAGT TTAGC GACGA TAATGGCTAT 180 

AGCACTAAGG AGGATGATCC GATATGACGC AGT CGCAGAC CGTGACGGTG GATCAGCAAG 240 

AGATTTTGAA CAGGGCCAAC GAGGTGGAGG CCCCGATGGC GGACCCACCG ACTGATGTCC 300 

CCATCACACC GTGCGAACTC ACGGCGGCTA AAAACGCCGC CCAACAGCTG GTATTGTCCG 360 

CCGACAACAT GCGGGAATAC CTGGCGGCCG GTGCCAAAGA GCGGCAGCGT CTGGCGACCT 42 0 

CGCTGCGCAA CGCGGC CAAG GCGTATGGCG AGGTTGATGA GGAGGCTGCG ACCGCGCTGG 48 0 

ACAACGACGG CGAAGGAAC T GTGCAGGCAG AATCGGCCGG GGCCGTCGGA GGGGACAGTT 540 

CGGCCGAACT AAC CGATACG CCGAGGGTGG CCACGGCCGG TGAACCCAAC TTCATGGATC 600 

TCAAAGAAGC GGCAAGGAAG CTCGAAACGG GCGACCAAGG CGCATCGCTC GCGCACTTTG 66 0 
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CGGATGGGTG GAACACTTTC AACCTGACGC TGCAAGGCGA CGTCAAGCGG TTCCGGGGGT 720 

TTGACAACTG GGAAGGCGAT GCGGCTACCG CTTGCGAGGC TTCGCTCGAT CAACAACGGC 78 0 

AATGGATACT CCACATGGCC AAATTGAGCG CTGCGATGGC CAAGCAGGCT CAATATGTCG 84 0 

CGCAGCTGCA CGTGTGGGCT AGGCGGGAAC ATCCGACTTA TGAAGACATA GTCGGGCTCG 900 

AACGGCTTTA CGCGGAAAAC CCTTCGGCCC GCGACCAAAT TCTCCCGGTG TACGCGGAGT 960 

ATCAGCAGAG GTCGGAGAAG GTGCTGACCG AATACAACAA CAAGGCAGCC CTGGAACCGG 1020 

TAAACCCGCC GAAGCCTCCC CCCGCCATCA AGATCGACCC GCCCCCGCCT CCGCAAGAGC 108 0 

AGGGATTGAT CCCTGGCTTC CTGATGC CGC CGTCTGACGG CTCCGGTGTG ACTC CCGGTA 1140 

CCGGGATGCC AGCCGCACCG ATGGTTCCGC CTACCGGATC GCCGGGTGGT GGCCTCCCGG 1200 

CTGACACGGC GGCGCAGCTG ACGTCGGCTG GGCGGGAAGC CGCAGCGCTG TCGGGCGACG 126 0 

TGGCGGTCAA AGCGGCATCG CTCGGTGGCG GTGGAGGCGG CGGGGTGCCG TCGGCGCCGT 132 0 

TGGGATCCGC GATCGGGGGC GCCGAATCGG TGCGGCCCGC TGGCGCTGGT GACATTGCCG 1380 

GCTTAGGCCA GGGAAGGGCC GGCGGCGGCG CCGCGCTGGG CGGCGGTGGC ATGGGAATGC 1440 

CGATGGGTGC CGCGCATCAG GGACAAGGGG GCGCCAAGTC CAAGGGTTCT CAGCAGGAAG 1500 

ACGAGGCGCT CTACACCGAG GATCGGGCAT GGACCGAGGC CGTCATTGGT AACCGTCGGC 1560 

GCCAGGACAG TAAGGAGTCG AAGTGAGCAT GGACGAATTG GACCCGCATG TCGCCCGGGC 162 0 

GTTGACGCTG GCGGCGCGGT TTCAGTCGGC CCTAGACGGG ACGCTCAATC AGATGAACAA 1680 

CGGATCCTTC CGCGCCACCG ACGAAGCCGA GACCGTCGAA GTGACGATCA ATGGGCACCA 1740 

GTGGCTCACC GGCCTGCGCA TCGAAGATGG TTTGCTGAAG AAGCTGGGTG CCGAGGCGGT 1800 

GGCTCAGCGG GTCAACGAGG CGCTGCACAA TGCGCAGGCC GCGGCGTCCG CGTATAACGA 1860 

CGCGGCGGGC GAGCAGCTGA CCGCTGCGTT ATCGGCCATG TCCCGCGCGA TGAACGAAGG 192 0 

AATGGCCTAA GCCCATTGTT GCGGTGGTAG CGACTACGCA CCGAATGAGC GCCGCAATGC 1980 

GGTCATTCAG CGCGCCCGAC ACGGCGTGAG TACGCATTGT CAATGTTTTG ACATGGATCG 2040 

GCCGGGTTCG GAGGGCGCCA TAGTCCTGGT CGCCAATATT GCCGCAGCTA GCTGGTCTTA 2100 

GGTTCGGTTA CGCTGGTTAA TTATGACGTC CGTTACCA 2138 
(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

Met Thr Gin Ser Gin Thr Val Thr Val Asp Gin Gin Slu lie Leu Asn 
15 10 15 

Arg Ala Asn Glu Val Glu Ala Pro Met Ala Asp Pro Pro Thr Asp Val 
20 25 30 

Pro lie Thr Pro Cys Glu Leu Thr Ala Ala Lys Asn Ala Ala Gin Gin 
35 40 45 

Leu Val Leu Ser Ala Asp Asn Met Arg Glu Tyr Leu Ala Ala Gly Ala 
50 55 60 

Lys Glu Arg Gin Arg Leu Ala Thr Ser Leu Arg Asn Ala Ala Lys Ala 
65 70 75 80 

Tyr Gly Glu Val Asp Glu Glu Ala Ala Thr Ala Leu Asp Asn Asp Gly 



Glu Gly Thr Val Gin Ala Glu Ser Ala Gly Ala Val Gly Gly Asp Ser 
100 105 110 

Ser Ala Glu Leu Thr Asp Thr Pro Arg Val Ala Thr Ala Gly Glu Pro 
115 120 125 

Asn Phe Met Asp Leu Lys Glu Ala Ala Arg Lys Leu Glu Thr Gly Asp 
130 135 140 

Gin Gly Ala Ser Leu Ala His Phe Ala Asp Gly Trp Asn Thr Phe Asn 
145 150 155 ISO 

Leu Thr Leu Gin Gly Asp Val Lys Arg Phe Arg Gly Phe Asp Asn Trp 
165 170 175 

Glu Gly Asp Ala Ala Thr Ala Cys Glu Ala Ser Leu Asp Gin Gin Arg 
180 185 190 

Gin Trp lie Leu His Met Ala Lys Leu Ser Ala Ala Met Ala Lys Gin 
195 200 205 

Ala Gin Tyr Val Ala Gin Leu His Val Trp Ala Arg Arg Glu His Pro 
210 215 220 

Thr Tyr Glu Asp lie Val Gly Leu Glu Arg Leu Tyr Ala Glu Asn Pro 
225 230 235 240 

Ser Ala Arg Asp Gin lie Leu Pro Val Tyr Ala Glu Tyr Gin Gin Arg 
245 250 255 

Ser Glu Lys Val Leu Thr Glu Tyr Asn Asn Lys Ala Ala Leu Glu Pro 
260 265 270 
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Val Asn Pro Pro Lys Pro Pro Pro Ala lie Lys lie Asp Pro Pro Pro 
275 280 285 

Pro Pro Gin Glu Gin Gly Leu lie Pro Gly Phe Leu Met Pro Pro Ser 
290 295 300 - 

Asp Gly Ser Gly Val Thr Pro Gly Thr Gly Met Pro Ala Ala Pro Met 
305 310 315 320 

Val Pro Pro Thr Gly Ser Pro Gly Gly Gly Leu Pro Ala Asp Thr Ala 
325 330 335 

Ala Gin Leu Thr Ser Ala Gly Arg Glu Ala Ala Ala Leu Ser Gly Asp 
340 345 350' 

Val Ala Val Lys Ala Ala Ser Leu Gly Gly Gly Gly Gly Gly Gly Val 
355 360 365 

Pro Ser Ala Pro Leu Gly Ser Ala lie Gly Gly Ala Glu Ser Val Arg 
370 375 380 

Pro Ala Gly Ala Gly Asp lie Ala Gly Leu Gly Gin Gly Arg Ala Gly 
385 390 395 400 

Gly Gly Ala Ala Leu Gly Gly Gly Gly Met Gly Met Pro Met Gly Ala 
405 410 415 

Ala His Gin Gly Gin Gly Gly Ala Lys Ser Lys Gly Ser Gin Gin Glu 
420 425 430 

Asp Glu Ala Leu Tyr Thr Glu Asp Arg Ala Trp Thr Glu Ala Val lie 
435 440 445 

Gly Asn Arg Arg Arg Gin Asp Ser Lys Glu Ser Lys 
450 455 460 

INFORMATION FOR SEQ ID NO: 18 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 5: 

Ala Gly Asn Val Thr Ser Ala Ser Gly Pro His Arg Phe Gly Ala Pro 



Asp Arg Gly Ser Gin Arg Arg Arg Arg His Pro Ala Ala Ser Thr Ala 



Thr Glu Arg Cys Arg Phe Asp Arg His Val Ala Arg Gin Arg Cys Gly 
35 40 45 
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Phe Pro Pro Ser Arg Arg Gin Leu Arg Arg Arg Val Ser Arg Glu Ala 
50 55 60 

Thr Thr Arg Arg Ser Gly Arg Arg Asn His Arg Cys Gly Trp His Pro 
65 70 75 - 80 

Gly Thr Gly Ser His Thr Gly Ala Val Arg Arg Arg His Gin Glu Ala 
85 90 95 

Arg Asp Gin Ser Leu Leu Leu Arg Arg Arg Gly Arg Val Asp Leu Asp 
100 105 110 

Gly Gly Gly Arg Leu Arg Arg Val Tyr Arg Phe Gin Gly Cys Leu Val 
115 120 125 

Val Val Phe Gly Gin His Leu Leu Arg Pro Leu Leu lie Leu Arg Val 
130 135 140 

His Arg Glu Asn Leu Val Ala Gly Arg Arg Val Phe Arg Val Lys Pro 
145 150 155 160 

Phe Glu Pro Asp Tyr Val Phe lie Ser Arg Met Phe Pro Pro Ser Pro 
165 170 175 

His Val Gin Leu Arg Asp lie Leu Ser Leu Leu Gly His Arg Ser Ala 
180 185 190 

Gin Phe Gly His Val Glu Tyr Pro Leu Pro Leu Leu lie Glu Arg Ser 
195 200 205 

Leu Ala Ser Gly Ser Arg lie Ala Phe Pro Val Val Lys Pro Pro Glu 
210 215 220 

Pro Leu Asp Val Ala Leu Gin Arg Gin Val Glu Ser Val Pro Pro lie 
225 230 235 240 

Arg Lys Val Arg Glu Arg Cys Ala Leu Val Ala Arg Phe Glu Leu Pro 
245 250 255 

Cys Arg Phe Phe Glu lie His Glu Val Gly Phe Thr Gly Arg Gly His 
260 265 270 

Pro Arg Arg lie Gly 
275 

INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 
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Arg Val Ala Ala Ser Phe lie Asp Trp Leu Asp Ser Pro Asp Ser Pro 
15 10 15 

Leu Asp Pro Ser Leu Val Ser Ser Leu Leu Asn Ala Val Ser Cys Gly 
20 25 " 30 

Ala Glu Ser Ser Ala Ser Ser Ser Ala Arg Ser Gly Asn Gly Ser Arg 
3 5 40 45 

Trp Thr Ser Met Pro Ser Gly Thr Arg Pro Gly Pro Arg Arg Ala Thr 
50 55 60 

Ser Arg Asp Asp Arg Arg Ser Ala Thr Ser Val lie Pro Ser Arg Arg 
65 70 75 80 

Ser Val Ala Pro Arg Ala Glu Phe Gly Thr Arg Leu Ala Ser His Arg 



Ala Ser Pro Ser Asn Ala Cys Pro Val Arg lie Val Thr Ser Ala Ser 
100 105 110 

Gly Arg Pro lie Ser Ser Pro Pro lie Val Arg Ser Arg Ser Cys Val 
115 120 125 

Asp Lys Asn Gly Arg Arg Cys Ala Ser Gly Tyr Arg Arg Leu Asn Arg 
130 135 140 

Ala Arg Ser Ser Ser lie Ala Ala Arg Cys Arg Thr He Gly Thr Phe 
145 150 155 160 

Arg Arg Ser Arg Tyr Ser Ala Ser Met Arg Val Ser Thr Asn Ser Pro 
165 170 175 

His Val Thr His Gly Val Ala Pro Gly Val Thr Arg Arg He Gly Gly 
180 185 190 



INFORMATION FOR SEQ ID NO: 18 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 196 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

Gin Glu Arg Pro Gin Met Cys Gin Arg Val Ser Glu He Glu Pro Arg 
15 10 15 

Thr Gin Phe Phe Asn Arg Cys Ala Leu Pro His Tyr Trp His Phe Pro 
20 25 30 

Ala Val Ala Val Phe Ser Lys His Ala Ser Leu Asp Glu Leu Ala Pro 
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35 40 45 

Arg Asn Pro Arg Arg Ser Ser Arg Arg Asp Ala Glu Asp Arg Arg Val 
50 55 60 

lie Phe Ala Ala Thr Leu Val Ala Val Asp Pro Pro ieu Arg Gly Ala 
65 70 75 80 

Gly Gly Glu Ala Asp Gin Leu lie Asp Leu Gly Val Cys Arg Arg Gin 
85 90 95 

Ala Gly Arg Val Arg Arg Gly Gin Glu Leu His His Arg His Arg His 
100 105 110 

Gin Gly Ala Ala Pro Asp Leu Arg Arg Arg Arg Arg His Arg Arg Val 
115 120 125 

Gin Gin His Arg Arg Leu Gin Arg Val Arg Gin Leu Arg Arg Tyr Val 
130 135 140 

Gin Thr Ala His His Arg Arg Phe Ala Arg Thr Asp Arg Val Arg His 
145 150 155 160 

His Val Arg Gly Pro Ser Asn His Arg Arg Arg Arg Val Tyr Arg Gly 
165 170 175 

Arg His Ser Gly Ala Gly Gly Cys Pro Ala Gly Gly Ala Gly Ser Val 
180 185 190 

Gly Gly Ser Ala 
195 

(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

Val Arg Cys Gly Thr Leu Val Pro Val Pro Met Val Glu Phe Leu Thr 
15 10 15 

Ser Thr Asn Ala Pro Ser Leu Pro Ser Ala Tyr Ala Glu Val Asp Lys 
20 25 30 

Leu lie Gly Leu Pro Ala Gly Thr Ala Lys Arg Trp lie Asn Gly Tyr 
35 40 45 

Glu Arg Gly Gly Lys Asp His Pro Pro lie Leu Arg Val Thr Pro Gly 
50 55 60 

Ala Thr Pro Trp Val Thr Trp Gly Glu Phe Val Glu Thr Arg Met Leu 
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Ala Glu Tyr Arg Asp Arg Arg Lys Val Pro lie Val Arg Gin Arg Ala 



Ala lie Glu Glu Leu Arg Ala Arg Phe Asn Leu Arg Tyr Pro Leu Ala 
100 105 110 

His Leu Arg Pro Phe Leu Ser Thr His Glu Arg Asp Leu Thr Met Gly 
115 120 125 

Gly Glu Glu lie Gly Leu Pro Asp Ala Glu Val Thr lie Arg Thr Gly 
130 135 140 

Gin Ala Leu Leu Gly Asp Ala Arg Trp Leu Ala Ser Leu Val Pro Asn 
145 150 155 160 

Ser Ala Arg Gly Ala Thr Leu Arg Arg Leu Gly lie Thr Asp Val Ala 
165 170 175 

Asp Leu Arg Ser Ser Arg Glu Val Ala Arg Arg Gly Pro Gly Arg Val 
180 185 190 

Pro Asp Gly lie Asp Val His Leu Leu Pro Phe Pro Asp Leu Ala Asp 
195 200 205 

Asp Asp Ala Asp Asp Ser Ala Pro His Glu Thr Ala Phe Lys Arg Leu 
210 215 220 

Leu Thr Asn Asp Gly Ser Asn Gly Glu Ser Gly Glu Ser Ser Gin Ser 
225 230 235 240 

lie Asn Asp Ala Ala Thr Arg Tyr Met Thr Asp Glu Tyr Arg Gin Phe 
245 250 255 

Pro Thr Arg Asn Gly Ala Gin Arg Ala Leu His Arg Val Val Thr Leu 
260 265 270 

Leu Ala Ala Gly Arg Pro Val Leu Thr His Cys Phe Ala Gly Lys Asp 
275 280 285 

Arg Thr Gly Phe Val Val Ala Leu Val Leu Glu Ala Val Gly Leu Asp 

290 295 300 

Arg Asp Val lie Val Ala Asp 
305 310 

INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

CTCGTGCCGA TTCGGCACGA GCTGAGCAGC CCAAGGGGCC GTTCGGCGAA GTCATCGAGG 60 

CATTCGCCGA CGGGCTGGCC GGCAAGGGTA AGCAAATCAA CACCACGCTG AACAGCCTGT 120 

CGCAGGCGTT GAACGCCTTG AATGAGGGCC GCGGCGACTT CTTCGCGGTG GTACGCAGCC 180 

TGGCGCTATT CGTCAACGCG CTACATCAGG ACGACCAACA GTTCGTCGCG TTGAACAAGA 240 

ACCTTGCGGA GTTCACCGAC AGGTTGACCC ACTCCGATGC GGACCTGTCG AACGCCATCC 300 

AGCAATTCGA CAGCTTGCTC GCCGTCGCGC GCCCGTTCTT CGCCAAGAAC CGCGAGGTGC 360 

TGACGCATGA CGTCAATAAT CTCGCGACCG TGACCACCAC GTTGCTGCAG CCCGATCCGT 420 

TGGATGGGTT GGAGACCGTC CTGCACATCT TCCCGACGCT GGCGGCGAAC ATTAACCAGC 480 

TTTACCATCC GACACACGGT GGCGTGGTGT CGCTTTCCGC GTTCACGAAT TTCGCCAACC 540 

CGATGGAGTT CATCTGCAGC TCGATTCAGG CGGGTAGCCG GCTCGGTTAT CAAGAGTCGG 600 

CCGAACTCTG TGCGCAGTAT CTGGCGCCAG TCCTCGATGC GATCAAGTTC AACTACTTTC 660 

CGTTCGGCCT GAACGTGGCC AGCACCGCCT CGACACTGCC TAAAGAGATC GCGTACTCCG 720 

AGCCCCGCTT GCAGCCGCCC AACGGGTACA AGGACACCAC GGTGCCCGGC ATCTGGGTGC 780 

CGGATACGCC GTTGTCACAC CGCAACACGC AGCCCGGTTG GGTGGTGGCA CCCGGGATGC 840 

AAGGGGTTCA GGTGGGACCG ATCACGCAGG GTTTGCTGAC GCCGGAGTCC CTGGCCGAAC 900 

TCATGGGTGG TCCCGATATC GCCCCTCCGT CGTCAGGGCT GCAAACCCCG CCCGGACCCC 960 

CGAATGCGTA CGACGAGTAC CCCGTGCTGC CGCCGATCGG TTTACAGGCC CCACAGGTGC 1020 

CGATACCACC GCCGCCTCCT GGGCCCGACG TAATCCCGGG TCCGGTGCCA CCGGTCTTGG 1080 

CGGCGATCGT GTTCCCAAGA GATCGCCCGG CAGCGTCGGA AAACTTCGAC TACATGGGCC 1140 

TCTTGTTGCT GTCGC CGGGC CTGGCGACCT TCCTGTTCGG GGTGTCATCT AGCCCCGCCC 12 00 

GTGGAACGAT GGCCGATCGG CACGTGTTGA TACCGGCGAT CACCGGCCTG GCGTTGATCG 12 60 

CGGCATTCGT CGCACATTCG TGGTAC CGCA CAGAACATCC GCTCATAGAC ATGCGCTTGT 132 0 

TCCAGAACCG AGCGGTCGCG CAGGC CAAC A TGACGATGAC GGTGCTCTCC CTCGGGCTGT 13 8 0 

TTGGCTCCTT CTTGCTGCTC CCGAGCTACC TCCAGCAAGT GTTGCACCAA TCACCGATGC 1440 

AATCGGGGGT GC ATAT CAT C CCACAGGGCC TCGGTGCCAT GCTGGCGATG CCGATCGCCG 1500 

GAGCGATGAT GGACCGACGG GGACCGGCCA AGATCGTGCT GGTTGGGATC ATGCTGATCG 1560 

CTGCGGGGTT GGGCACCTTC GCCTTTGGTG TCGCGCGGCA AGCGGACTAC TTACCCATTC 162 0 
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TGCCGACCGG GCTGGCAATC ATGGGCATGG GCATGGGCTG CTCCATGATG CCACTGTCCG 1680 

GGGCGGCAGT GCAGACCCTG GCCCCACATC AGATCGCTCG CGGTTCGACG CTGATCAGCG 1740 

TCAAC CAGCA GGTGGGCGGT TCGATAGGGA CCGCACTGAT GTCGGTGCTG CTCACCTACC 1800 

AGTTCAATCA CAGCGAAATC ATCGCTACTG CAAAGAAAGT CGCACTGACC CCAGAGAGTG I860 

GCGCCGGGCG GGGGGCGGCG GTTGACCCTT CCTCGCTACC GCGCCAAACC AACTTCGCGG 1920 

CCCAACTGCT GCATGACCTT TCGCACGCCT ACGCGGTGGT ATTCGTGATA GCGACCGCGC 1980 

TAGTGGTCTC GACGCTGATC CCCGCGGCAT TCCTGCCGAA AC AGCAGG CT AGTCATCGAA 2040 

GAGCACCGTT GCTATCCGCA TGACGTCTGC TT 2072 
(2) INFORMATION FOR SEQ ID NO: 19 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1923 base pairs 

(B) TYPE: nucleic acid. 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:190: 

TCACCCCGGA GAAGTCGTTC GTCGACGACC TGGACATCGA CTCGCTGTCG ATGGTCGAGA 60 

TCGCCGTGCA GACCGAGGAC AAGTACGGCG TCAAGATCCC CGACGAGGAC CTCGCCGGTC 12 0 

TGCGTACCGT CGGTGACGTT GTCGCCTACA TCCAGAAGCT CGAGGAAGAA AACCCGGAGG 180 

CGGCTCAGGC GTTGCGCGCG AAGATTGAGT CGGAGAACCC CGATGCGGCA CGAGCAGATC 240 

GGTGCGTTTC ACCCACATCG CAAGCTCGAG ACGCCCGTCG TCCTCTTGCA CGCTCAGCCA 300 

GGTTGGCGTG TCGCCGCCTT CCAGCAAGTG TTCCCACCAC ACGAAGGGAC CCTCGCGAAA 360 

GGTGACTGAT CCGCGGACCA CATAGTCGAT GCCACCGTGG CTGACAATTG CGCCGGGTCC 42 0 

GAGTTGGCGG GGGCCGAATT GCGGCATTGC GTCGAAGGCC AGCGGATCCC GGCGCCCGCC 480 

CGGCGTGGCT GGTGTTTTGG GCCGCCGGAT GGCCACGACG AGAACGACGA TGGCGGCGAT 540 

GAACAGCGCC ACGGCAATCA CGACCAGCAG ATTTCCCACG CATACCCTCT CGTACCGCTG 600 

CGCCGCGGTT GGTCGATCGG TCGCATATCG ATGGCGCCGT TTAACGTAAC AGCTTTCGCG 66 0 

GGACCGGGGG TCACAACGGG CGAGTTGTCC GGCCGGGAAC CCGGCAGGTC TCGGCCGCGG 72 0 

TCACCCCAGC TCACTGGTGC ACCATCCGGG TGTCGGTGAG CGTGCAACTC AAACACACTC 78 0 

AACGGCAACG GTTTCTCAGG TCACCAGCTC AACCTCGACC CGCAATCGCT CGTACGTTTC 84 0 
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GACCGCGCGC AGGTCGCGAG TCAGCAGCTT TGCGCCGGCA GCTTTCGCCG TGAAGCCGAC 900 

CAGGGCATCG TAGGTTGCGC CACCGGTGAC ATCGTGCTCG GCGAGGTGGT CGGTCAAGCC 960 

GCGATATGAG CAGGCATCCA GTGCCAGGTA GTTGCTGGAG GTGATGTCCG CCAAGTAGGC 1020 

GTGGACGGCA ACAGGGGCAA TACGATGCGG CGGTGGTAGC CGGGTCAAGA CCGAATAGGT 1080 

TTCCACAGCC GCGTGCGCGA TCAGATGGAC GCCACGGTTG AGCGCGCGCA CGGCGGCCTC 1140 

GTGCCCTTCG TGCCAGGTCG CGAATCCGGC AACCAGCACG CTGGTGTCTG GTGCGATCAC 1200 

CGCCGTGTGC GATCGAGCGT TTCCCGAACG ATTTCGTCGG TCAACGGGGG CAGGGGACGT 1260 

TCTGGCCGTG CGACGAGAAC CGAGCCTTCC CGAACGAGTT CGACACCGGT CGGGGCCGGC 1320 

TCAATCT CGA TGCGCCCATC GCGCTCGGTG ATCTCCACCT GGTCGTTCCC GCGCAAGCCA 138 0 

AGGCGCTCGC GAATCCGCTT GGGAATCACC AGACGTCCTG CGACATCGAT GGTTGTTCGC 1440 

ATGGTAGGAA ATTTACCATC GCACGTTCCA TAGGCGTGTC CTGCG CGGGA TGTCGGGACG 1500 

ATCCGCTAGC GTATCGAACG ATTGTTTCGG AAATGGCTGA GGGAGCGTGC GGTGCGGGTG 1560 

ATGGGTGTCG ATCCCGGGTT GACCCGATGC GGGCTGTCGC TCATCGAGAG TGGGCGTGGT 162 0 

CGGCAGCTCA CCGCGCTGGA TGTCGACGTG GTGCGCACAC CGTCGGATGC GGCCTTGGCG 1680 

CAGCGCCTGT TGGCCATCAG CGATGCCGTC GAGCACTGGC TGGACACCCA TCATCCGGAG 174 0 

GTGGTGGCTA TCGAACGGGT GTTCTCTCAG CTCAACGTGA CCACGGTGAT GGGCACCGCG 1800 

CAGGCCGGCG GCGTGATCGC CCTGGCGGCG GCCAAACGTG GTGTCGACGT GCATTTCCAT 1860 

ACCCCCAGCG AGGTCAAGGC GGCGGTCACT GGCAACGGTT CCGCAGACAA GGCTCAGGTC 1920 

ACC 1923 
(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1055 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

CTGGCGTGCC AGTGTCACCG GCGATATGAC GTCGGCATTC AATTTCGCGG CCCCGCCGGA 60 

CCCGTCGCCA CCCAATCTGG ACCACCCGGT CCGTCAATTG CCGAAGGTCG CCAAGTGCGT 120 

GCCCAATGTG GTGCTGGGTT TCTTGAACGA AGGCCTGCCG TATCGGGTGC CCTACCCCCA 18 0 

AACAACGCCA GTC CAGGAAT CCGGTCCCGC GCGGCCGATT CCCAGCGGCA TCTGCTAGCC 24 0 
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GGGGATGGTT CAGACGTAAC GGTTGGCTAG GTCGAAACCC GCGCCAGGGC CGCTGGACGG 3 00 

GCTCATGGCA GCGAAATTAG AAAACCCGGG ATATTGTCCG CGGATTGTCA TACGATGCTG 360 

AGTGCTTGGT GGTTCGTGTT TAGCCATTGA GTGTGGATGT GTTGAGACCC TGGCCTGGAA 420 

GGGGACAACG TGCTTTTGCC TCTTGGTCCG CCTTTGCCGC CCGACGCGGT GGTGGCGAAA 4 80 

CGGGCTGAGT CGGGAATGCT CGGCGGGTTG TCGGTTCCGC TCAGCTGGGG AGTGGCTGTG 540 

CCACCCGATG ATTATGACCA CTGGGCGCCT GCGCCGGAGG ACGGCGCCGA TGTCGATGTC 6 00 

CAGGCGGCCG AAGGGGCGGA CGCAGAGGCC GCGGCCATGG ACGAGTGGGA TGAGTGGCAG 6 60 

GCGTGGAACG AGTGGGTGGC GGAGAACGCT GAACCCCGCT TTGAGGTGCC ACGGAGTAGC 720 

AGCAGCGTGA TTCCGCATTC TCCGGCGGCC GGCTAGGAGA GGGGGCGCAG ACTGTCGTTA 780 

TTTGACCAGT GATCGGCGGT CTCGGTGTTC CCGCGGCCGG CTATGACAAC AGTCAATGTG 84 0 

CATGACAAGT TACAGGTATT AGGTCCAGGT TCAACAAGGA GACAGGCAAC ATGGCAACAC 90 0 

GTTTTATGAC GGATC CGCAC GCGATGCGGG ACATGGCGGG CCGTTTTGAG GTGCACGCCC 960 

AGACGGTGGA GGACGAGGCT CGCCGGATGT GGGCGTCCGC GCAAAACATC TCGGGNGCGG 102 0 

GCTGGAGTGG CATGGCCGAG GCGACCTCGC TAGAC 1055 
(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:192: 

CCGCCTCGTT GTTGGCATAC TCCGCCGCGG CCGCCTCGAC CGCACTGGCC GTGGCGTGTG 60 

TCCGGGCTGA CCACCGGGAT CGCCGAACCA TCCGAGATCA CCTCGCAATG ATCCACCTCG 120 

CGCAGCTGGT CACCCAGCCA CCGGGCGGTG TGCGACAGCG CCTGCATCAC CTTGGTATAG 180 

CCGTCGCGCC CCAGCCGCAG GAAGTTGTAG TACTGGCCCA CCACCTGGTT ACCGGGACGG 24 0 

GAGAAGTTCA GGGTGAAGGT CGGCATGTCG CCGCCGAGGT AGTTGACCCG GAAAACCAGA 300 

TCCTCCGGCA GGTGCTCGGG CCCGCGCCAC ACGACAAACC CGACGCCGGG ATAGGTCAG 359 

(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNES S : single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

AACGGGCCCG TGGGCACCGC TCCTCTAAGG GCTCTCGTTG GTCGCATGAA GTGCTGGAAG 60 

GATGCATCTT GGCAGATTCC CGCCAGAGCA AAACAGCCGC TAGTCCTAGT CCGAGTCGCC 120 

CGCAAAGTTC CTCGAATAAC TCCGTACCCG GAGCGCCAAA CCGGGTCTCC TTCGCTAAGC 180 

TGCGCGAACC ACTTGAGGTT CCGGGACTCC TTGACGTCCA GACCGATTCG TTCGAGTGGC 240 

TGATCGGTTC GCCGCGCTGG CGCGAATCCG CCGCCGAGCG GGGTGATGTC AACCCAGTGG 300 

GTGGCCTGGA AGAGGTGCTC TACGAGCTGT CTCCGATCGA GGACTTCTCC 350 

(2) INFORMATION FOR SEQ ID NO : 194 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 79 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

Glu Gin Pro Lys Gly Pro Phe Gly Glu Val lie Glu Ala Phe Ala Asp 
1 5 10 15 

Gly Leu Ala Gly Lys Gly Lys Gin lie Asn Thr Thr Leu Asn Ser Leu 
20 25 30 

Ser Gin Ala Leu Asn Ala Leu Asn Glu Gly Arg Gly Asp Phe Phe Ala 
35 40 45 

Val Val Arg Ser Leu Ala Leu Phe Val Asn Ala Leu His Gin Asp Asp 
50 55 60 

Gin Gin Phe Val Ala Leu Asn Lys Asn Leu Ala Glu Phe Thr Asp Arg 
65 70 75 80 

Leu Thr His Ser Asp Ala Asp Leu Ser Asn Ala lie Gin Gin Phe Asp 
85 90 95 

Ser Leu Leu Ala Val Ala Arg Pro Phe Phe Ala Lys Asn Arg Glu Val 
100 105 110 

Leu Thr His Asp Val Asn Asn Leu Ala Thr Val Thr Thr Thr Leu Leu 
115 120 125 

Gin Pro Asp Pro Leu Asp Gly Leu Glu Thr Val Leu His lie Phe Pro 
130 135 140 
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Thr Leu Ala Ala Asn lie Asn Gin Leu Tyr His Pro Thr His Gly Gly 
145 150 155 160 

Val Val Ser Leu Ser Ala Phe Thr Asn Phe Ala Asn Pro Met Glu Phe 
165 170 175 

He Cys Ser Ser He Gin Ala Gly Ser Arg Leu Gly Tyr Gin Glu Ser 
180 185 190 

Ala Glu Leu Cys Ala Gin Tyr Leu Ala Pro Val Leu Asp Ala He Lys 
195 200 205 

Phe Asn Tyr Phe Pro Phe Gly Leu Asn Val Ala Ser Thr Ala Ser Thr 
210 215 220 

Leu Pro Lys Glu He Ala Tyr Ser Glu Pro Arg Leu Gin Pro Pro Asn 
225 230 235 240 

Gly Tyr Lys Asp Thr Thr Val Pro Gly He Trp Val Pro Asp Thr Pro 
245 250 255 

Leu Ser His Arg Asn Thr Gin Pro Gly Trp Val Val Ala Pro Gly Met 
260 265 270 

Gin Gly Val Gin Val Gly Pro He Thr Gin Gly Leu Leu Thr Pro Glu 

275 280 285 

Ser Leu Ala Glu Leu Met Gly Gly Pro Asp He Ala Pro Pro Ser Ser 
290 295 300 

Gly Leu Gin Thr Pro Pro Gly Pro Pro Asn Ala Tyr Asp Glu Tyr Pro 
305 310 315 320 

Val Leu Pro Pro He Gly Leu Gin Ala Pro Gin Val Pro He Pro Pro 
325 330 335 

Pro Pro Pro Gly Pro Asp Val He Pro Gly Pro Val Pro Pro Val Leu 
340 345 350 

Ala Ala He Val Phe Pro Arg Asp Arg Pro Ala Ala Ser Glu Asn Phe 
355 360 365 

Asp Tyr Met Gly Leu Leu Leu Leu Ser Pro Gly Leu Ala Thr Phe Leu 
370 375 380 

Phe Gly Val Ser Ser Ser Pro Ala Arg Gly Thr Met Ala Asp Arg His 
385 390 395 400 

Val Leu He Pro Ala He Thr Gly Leu Ala Leu He Ala Ala Phe Val 
405 410 415 

Ala His Ser Trp Tyr Arg Thr Glu His Pro Leu He Asp Met Arg Leu 
420 425 430 

Phe Gin Asn Arg Ala Val Ala Gin Ala Asn Met Thr Met Thr Val Leu 



mm 
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Ser Leu Gly Leu Phe Gly Ser Phe Leu Leu Leu Pro Ser Tyr Leu Gin 
450 455 460 

Gin Val Leu His Gin Ser Pro Met Gin Ser Gly Val 'His lie lie Pro 
465 470 475 480 

Gin Gly Leu Gly Ala Met Leu Ala Met Pro lie Ala Gly Ala Met Met 
485 490 495 

Asp Arg Arg Gly Pro Ala Lys lie Val Leu Val Gly lie Met Leu lie 
500 505 510 

Ala Ala Gly Leu Gly Thr Phe Ala Phe Gly Val Ala Arg Gin- Ala Asp 
515 520 525 

Tyr Leu Pro lie Leu Pro Thr Gly Leu Ala lie Met Gly Met Gly Met 
530 535 540 

Gly Cys Ser Met Met Pro Leu Ser Gly Ala Ala Val Gin Thr Leu Ala 
545 550 555 560 

Pro His Gin lie Ala Arg Gly Ser Thr Leu lie Ser Val Asn Gin Gin 
565 570 575 

Val Gly Gly Ser lie Gly Thr Ala Leu Met Ser Val Leu Leu Thr Tyr 
580 585 590 

Gin Phe Asn His Ser Glu lie lie Ala Thr Ala Lys Lys Val Ala Leu 
595 600 605 

Thr Pro Glu Ser Gly Ala Gly Arg Gly Ala Ala Val Asp Pro Ser Ser 
610 615 620 

Leu Pro Arg Gin Thr Asn Phe Ala Ala Gin Leu Leu His Asp Leu Ser 
625 630 635 640 

His Ala Tyr Ala Val Val Phe Val lie Ala Thr Ala Leu Val Val Ser 
645 650 655 

Thr Leu lie Pro Ala Ala Phe Leu Pro Lys Gin Gin Ala Ser His Arg 
660 665 670 

Arg Ala Pro Leu Leu Ser Ala 
675 

(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

Thr Pro Glu Lys Ser Phe Val Asp Asp Leu Asp lie Asp Ser Leu Ser 
15 10 15 

Met Val Glu lie Ala Val Gin Thr Glu Asp Lys Tyr Sly Val Lys lie 
20 25 30 

Pro Asp Glu Asp Leu Ala Gly Leu Arg Thr Val Gly Asp Val Val Ala 
35 40 45 

Tyr lie Gin Lys Leu Glu Glu Glu Asn Pro Glu Ala Ala Gin Ala Leu 
50 55 60 

Arg Ala Lys lie Glu Ser Glu Asn Pro Asp Ala Ala Arg Ala Asp Arg 
65 70 75 80 

Cys Val Ser Pro Thr Ser Gin Ala Arg Asp Ala Arg Arg Pro Leu Ala 
85 90 95 

Arg Ser Ala Arg Leu Ala Cys Arg Arg Leu Pro Ala Ser Val Pro Thr 
100 105 110 

Thr Arg Arg Asp Pro Arg Glu Arg 
115 120 

(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Leu Ala Cys Gin Cys His Arg Arg Tyr Asp Val Gly lie Gin Phe Arg 
15 10 15 

Gly Pro Ala Gly Pro Val Ala Thr Gin Ser Gly Pro Pro Gly Pro Ser 
20 25 30 

lie Ala Glu Gly Arg Gin Val Arg Ala Gin Cys Gly Ala Gly Phe Leu 
35 40 45 

Glu Arg Arg Pro Ala Val Ser Gly Ala Leu Pro Pro Asn Asn Ala Ser 
50 55 60 

Pro Gly lie Arg Ser Arg Ala Ala Asp Ser Gin Arg His Leu Leu Ala 



Gly Asp Gly Ser Asp Val Thr Val Gly 
85 



(2) INFORMATION FOR SEQ ID NO: 197: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 

Ala Ser Leu Leu Ala Tyr Ser Ala Ala Ala Ala Ser Thr Ala Leu Ala 
15 10 15 

Val Ala Cys Val Arg Ala Asp His Arg Asp Arg Arg Thr lie Arg Asp 
20 25 30 

His Leu Ala Met lie His Leu Ala Gin Leu Val Thr Gin Pro Pro Gly 
35 40 45 

Gly Val Arg Gin Arg Leu His His Leu Gly lie Ala Val Ala Pro Gin 
50 55 60 

Pro Gin Glu Val Val Val Leu Ala His His Leu Val Thr Gly Thr Gly 
65 70 75 80 

Glu Val Gin Gly Glu Gly Arg His Val Ala Ala Glu Val Val Asp Pro 
85 90 95 

Glu Asn Gin lie Leu Arg Gin Val Leu Gly Pro Ala Pro His Asp Lys 
100 105 110 

Pro Asp Ala Gly lie Gly Gin 
115 

INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

Arg Ala Arg Gly His Arg Ser Ser Lys Gly Ser Arg Trp Ser His Glu 
15 10 15 

Val Leu Glu Gly Cys lie Leu Ala Asp Ser Arg Gin Ser Lys Thr Ala 
20 25 30 

Ala Ser Pro Ser Pro Ser Arg Pro Gin Ser Ser Ser Asn Asn Ser Val 



Pro Gly Ala Pro Asn Arg Val Ser Phe Ala Lys Leu Arg Glu Pro Leu 
50 55 60 
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Glu Val Pro Gly Leu Leu Asp Val Gin Thr Asp Ser Phe Glu Trp Leu 
65 70 75 80 

lie Gly Ser Pro Arg Trp Arg Glu Ser Ala Ala Glu Arg Gly Asp Val 
85 90 95 

Asn Pro Val Gly Gly Leu Glu Glu Val Leu Tyr Glu Leu Ser Pro lie 
100 105 110 

Glu Asp Phe Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

TGCTACGCAG CAATCGCTTT GGTGACAGAT GTGGATGCCG GCGTCGCTGC TGGCGATGGC 60 

GTGAAAGCCG CCGACGTGTT CGCCGCATTC GGGGAGAACA TCGAACTGCT CAAAAGGCTG 120 

GTGCGGGCCG CCATCGATCG GGTCGCCGAC GAGCGC AC GT GCACGCACTG TCAACACCAC 180 

GCCGGTGTTC CGTTGCCGTT CGAGCTGCCA TGAGGGTGCT GCTGACCGGC GCGGC CGGCT 240 

TCATCGGGTC GCGCGTGGAT GCGGCGTTAC GGGCTGCGGG TCACGACGTG GTGGG CGTCG 3 00 

ACGCGCTGCT GCCCGCCGCG CACGGGCCAA ACCCGGTGCT GCCACCGGGC TGCCAGCGGG 360 

TCGACGTGCG CGACGCCAGC GCGCTGGCCC CGTTGTTGGC CGGTGTCGAT CTGGTGTGTC 420 

ACCAGGCCGC CATGGTGGGT GCCGGCGTCA ACGCCGCCGA CGCACCCGCC TATGGCGGCC 480 

ACAACGATTT CGCCACCACG GTGCTGCTGG CGCAGATGTT CGCCGCCGGG GTCCGCCGTT 540 

TGGTGCTGGC GTCGTCGATG GTGGTTTACG GGCAGGGGCG CTATGACTGT CCCCAGCATG 60 0 

GACCGGTCGA CCCGCTGCCG CGGCGGCGAG CCGACCTGGA CAATGGGGTC TTCGAGCACC 66 0 

GTTGCCCGGG GTGCGGCGAG CCAGTCATCT GGCAATTGGT CGACGAAGAT GCCCCGTTGC 72 0 

GCCCGCGCAG CCTGTACGCG GCAGCAAGAC CGCGCAGGAG CACTACGCGC TGGCGTGGTC 78 0 

GGAAACGAAT GGCGGTTCCG TGGTGGCGTT G 811 
(2) INFORMATION FOR SEQ ID NO:200: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 966 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:200: 

GTCCCGCGAT GTGGCCGAGC ATGACTTTCG GCAACACCGG CGTAGTAGTC GAAGATATCG 60 

GACTTTGTGG TCCCGGTGGC GGGATAGAGC ACCTGTCGGC GTTGGTCAGC GTCACCCGTT 12 0 

GCTCGGACGC CGAACCCATG CTTTCAACGT AGCCTGTCGG TCACACAAGT CGCGAGCGTA 18 0 

ACGTCACGGT CAAATATCGC GTGGAATTTC GCCGTGACGT TCCGCTCGCG GACAATCAAG 240 

GCATACTCAC TTACATG CGA GCCATTTGGA CGGGTTCGAT CGCCTTCGGG CTGGTGAACG 300 

TGCCGGTCAA GGTGTACAGC GCTACCGCAG ACCACGACAT CAGGTTC C AC CAGGTGCACG 360 

CCAAGGACAA CGGACGCATC CGGTACAAGC GCGTCTGCGA GGCGTGTGGC GAGGTGGTCG 420 

ACTACCGCGA TCTTGCCCGG GCCTACGAGT CCGGCGACGG CCAAATGGTG GCGATCACCG 480 

ACGACGACAT CGCCAGCTTG CCTGAAGAAC GCAGCCGGGA GATCGAGGTG TTGGAGTTCG 540 

TCCCCGCCGC CGACGTGGAC CCGATGATGT TCGACCGCAG CTACTTTTTG GAGC CTGATT 600 

CGAAGTCGTC GAAATCGTAT GTGCTGCTGG CTAAGACACT CGCCGAGACC GACCGGATGG 660 

CGATCGTGGA TCGCCCCACC GGCCGTGAAT GCAGGAAAAA TAAGAGCCGC TATCCACAAT 72 0 

TCGGCGTCGA GCTCGGCTAC CACAAACGGT AGAACGATCG AGACATTCCC GAGCTGAAGT 780 

GCGGCGCTAT AGAAGCCGCT CTGCGCGATT ATCAAACGCA AAATACGCTT ACTCATGCCA 84 0 

TCGGCGCTGC TCACCCGATG CGACGTTTTT GCCACGCTCC ACCGCCTGCC GCGCGACCTC 900 

AAGTGGGCAT GCATCCCACC CGTTCCCGGA AACCGGTTCC GGCGGGTCGG CTCATCGCTT 960 

CATCCT 966 

(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 36 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 01: 

CCGCACCGCC GGCAATACCG CCAGCGCCAC CGTTACCGCC GTTTGCGCCG TTGCCCCCGT 60 

TGCCGCCCGT CCCGCCGGCC CCGCCGATGG AGTTCTCATC GCCAAAAGTA CTGGCGTTGC 12 0 

CACCGGAGCC GCCGTTGCCG CCGTCACCGC CAGCCCCGCC GACTCCACCG GCCCCACCGA 180 



191 



CTCCGCCGCT 


GCCACCGTTG 


CCGCCGTTGC 


CGATCAACAT GCCGCTGGCG CCACCCTTGC 




CACCCACGCC 


ACCGGCTCCG 


CCCACCCCGC 


CGACACCAAG CGAGCTGCCG CCGGAGCCAC 




CATCACCACC 


TACGCCACCG 


ACCGCCCAGA 


CACCAGCGAC CGGGTCTTCG TGAAACGTCG 




CGGTGCCACC 


ACCGCCGCCG 


TTACCGCCAA 


CCCCACCGGC AACGCCGGCG CCGCCATCCC 




CGCCGGCCCC 


GGCGTTGCCG 


CCGTTGCCGC 


CGTTGCCGAA CAACAACCCG CCGGCGCCGC 




CGTTGCCGCC 


CGCGCCGCCG 


GTCCCGCCGG 


CGCCGCCGAC GCCAAGGCCG CTGCCGCCCT 




TGCCGCCATC 


ACCACCCTTG 


CCGCCGACCA 


CATCGGGTTC TGCCTCGGGG TCTGGGCTGT 




CAAACCTCGC 


GATGCCAGCG 


TTGCCGCCGC 


TTCCCCCGGG CCCCCCCGTG GCGCCGTCAC 




CACCGATACC 


ACCCGCGCCA 


CCGGCGCCAC 


CGTTGCCGCC ATC AC CGAAT AGCAACCCGC 




CGGCGCCACC 


ATTGCCGCCA 


GCTCCCCCTG 


CGCCACCGTC GGCGCCGGAG GCGGCACTGG 


780 


CAGCCCCGTT 


ACCACCGAAA 


CCGCCGCTAC 


CACCGGTAGA GGTGGCAGTG GCGATGTGTA 


840 


CGAAAGCGCC 


GCCTCCGGCG 


CCGCCGCTAC 


CACCCCCACT GCCGGCGGCT ACAC CGTCGG 


900 


ACCCGTTGCC 


ACCATCACCG 


CCAAAGGCGC 


TCGCAATGTC GCCCTGCGCG ACTCCGCCGT 


960 


CGCCGCCGTT 


GCCGCCGCCG 


CCACCGGCAG 


CGGCGGTACC GCCGTCACCA CCGGCACCGC 


1020 


CGGTGGCCTT 


GCCCGAGCCT 


GCCGTCGCGG 


TGGCACCGTC GCCGCCGGTG CCACCGGTCG 


1080 


GCGTGCCGGC 


AGTGCCATGG 


CCGCCCGTGC 


CGCCGTCGCC GCCGGTTTGA TCACCGATGC 




CGGACACATC 


TGCCGGGCTG 


TCCCCGGTGC 


TGGC CGCGGG GCCGGGCGTG GGATTGACCC 


1200 


CGTTTGCCCC 


GGCGAGGCCG 


GCGCCGCCGG 


TACCACCGGC GCCGCCATGG CCGAACAGCC 


1260 


CGGCGTTGCC 


GCCGTTACCG 


CCCGCACCCC 


CGATGCCTGC GGCCACGCTG GTGCCGCCGA 


1320 


CACCGCCGTT 


GCCGCCGTTG 


CCCCACAACC 


ACCCCCCGTT CCCACCGGCA CCGCCGGCCG 


1380 


CGCCGGTACC 


ACCGGCCCCG 


CCGTTGCCGC 


CGTTGCCGAT CAACCCGGCC GCGCCTCCGC 




TGCCGCCGGT 


TTGACCGAAC 


CCGCCAGCCG 


CGCCGTTGCC ACCGTTGCCA AACAGCAACC 


1500 


CGCCGGCCGC 


GCCAGGCTGC 


CCGGGTGCCG 


TCCCGTCGGC GCCGTTTCCG ATCAACGGGC 


1560 


GCCCCAAAAG 


CGCCTCGGTG 


GGCGCATTCA 


CCGCACCCAG CAGACTCCGC TCAACAGCGG 


1620 


CTTCAGTGCT 


GGCATACCGA 


CCCGCGGCCG 


CAGTCAACGC CTGCACAAAC TGCTCGTGAA 


1680 


ACGCTGCCAC 


CTGTACGCTG 


AGCGCCTGAT 


ACTGCCGAGC ATGGGCCCCG AACAACCCCG 


1740 


CAATCGCCGC 


CGACACTTCA 


TCGGCAGCCG 


CAGCCACCAC TTCCGTCGTC GGGATCGCCG 


1800 


CGGCCGCATT 


AGCCGCGCTC 


ACCTGCGAAC 


CAATAGTCGA TAAATCCAAA GCCGCAGTTG 


1860 
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CCAGCAGCTG 


CGGCGTCGCG 


ATCACCAAGG 


ACACCTCGCA 


CCTCCGGATA 


CCCCATATCG 


1920 


CCGCACCGTG 


TCCCCAGCGG 


CCACGTGACC 


TTTGGTCGCT 


GGCTGGCGGC 


CCTGACTATG 


1980 


GCCGCGACGG 


CCCTCGTTCT 


GATTCGCCCC 


GGCGCGCAGC 


TTGTTGCGCG 


AGTTGAAGAC 


2040 


GGGAGGACAG 


GCCGAGCTTG 


GTGTAGACGT 


GGGTCAAGTG 


GGAATGCACG 


GTCCGCGGCG 


2100 


AGATGAATAG 


GCGGACGCCG 


ATCTCCTTGT 


TGCTGAGTCC 


CTCACCGACC 


AGTAGAGCCA 


2160 


CCTCAAGCTC 


TGTCGGTGTC 


AACGCGCCCC 


AGCCACTTGT 


CGGGCGTTTC 


CGTGCACCGC 


2220 


GGCCTCGTTG 


CGCGTACGCG 


ATCGCCTCAT 


CGATCGATAA 


CGCAGTTCCT 


TCGGCCCAGG 


2280 


CATCGTCGAA 


CTCGCTGTCA 


CCCATGGATT 


TTCGAAGGGT 


GGCTAGCGAC 


GAGTTACAGC 


2340 


CCGCCTGGTA 


GATCCCGAAG 


CGGACCG 








2367 


(2) INFORMATION FOR S 


3Q ID NO: 202: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: 

Gin Pro Ala Gly Ala Thr lie Ala Ala Ser Ser Pro Cys Ala Thr Val 

15 10 15 

Gly Ala Gly Gly Gly Thr Gly Ser Pro Val Thr Thr Glu Thr Ala Ala 
20 25 30 

Thr Thr Gly Arg Gly Gly Ser Gly Asp Val Tyr Glu Ser Ala Ala Ser 
35 40 45 

Gly Ala Ala Ala Thr Thr Pro Thr Ala Gly Gly Tyr Thr Val Gly Pro 
50 55 60 

Val Ala Thr lie Thr Ala Lys Gly Ala Arg Asn Val Ala Leu Arg Asp 
65 70 75 80 

Ser Ala Val Ala Ala Val Ala Ala Ala Ala Thr Gly Ser Gly Gly Thr 
85 90 95 

Ala Val Thr Thr Gly Thr Ala Gly Gly Leu Ala Arg Ala Cys Arg Arg 
100 105 HO 

Gly Gly Thr Val Ala Ala Gly Ala Thr Gly Arg Arg Ala Gly Ser Ala 
115 120 125 

Met Ala Ala Arg Ala Ala Val Ala Ala Gly Leu lie Thr Asp Ala Gly 
130 135 140 
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His He Cys Arg Ala Val Pro Gly Ala Gly Arg Gly Ala Gly Arg Gly 
145 150 155 160 

He Asp Pro Val Cys Pro Gly Glu Ala Gly Ala Ala Gly Thr Thr Gly 
165 170 - 175 

Ala Ala Met Ala Glu Gin Pro Gly Val Ala Ala Val Thr Ala Arg Thr 
180 185 190 

Pro Asp Ala Cys Gly His Ala Gly Ala Ala Asp Thr Ala Val Ala Ala 
195 200 205 

Val Ala Pro Gin Pro Pro Pro Val Pro Thr Gly Thr Ala Gly Arg Ala 
210 215 220 

Gly Thr Thr Gly Pro Ala Val Ala Ala Val Ala Asp Gin Pro Gly Arg 
225 230 235 240 

Ala Ser Ala Ala Ala Gly Leu Thr Glu Pro Ala Ser Arg Ala Val Ala 
245 250 255 

Thr Val Ala Lys Gin Gin Pro Ala Gly Arg Ala Arg Leu Pro Gly Cys 
260 265 270 

Arg Pro Val Gly Ala Val Ser Asp Gin Arg Ala Pro Gin Lys Arg Leu 
275 280 285 

Gly Gly Arg He His Arg Thr Gin Gin Thr Pro Leu Asn Ser Gly Phe 
290 295 300 

Ser Ala Gly lie Pro Thr Arg Gly Arg Ser Gin Arg Leu His Lys Leu 
305 310 315 320 

Leu Val Lys Arg Cys His Leu Tyr Ala Glu Arg Leu He Leu Pro Ser 
325 330 335 

Met Gly Pro Glu Gin Pro Arg Asn Arg Arg Arg His Phe He Gly Ser 
340 345 350 

Arg Ser His His Phe Arg Arg Arg Asp Arg Arg Gly Arg He Ser Arg 
355 360 365 

Ala His Leu Arg Thr Asn Ser Arg 
370 375 

INFORMATION FOR SEQ ID NO: 2 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2852 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 
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GGCCAAAACG CCCCGGCGAT CGCGGCCACC GAGGCCGCCT ACGAC CAGAT GTGGGCCCAG 60 

GACGTGGCGG CGATGTTTGG CTACCATGCC GGGGCTTCGG CGGCCGTCTC GGCGTTGACA 120 

CCGTTCGGCC AGGCGCTGCC GACCGTGGCG GGCGGCGGTG CGCTGGTCAG CGCGGCCGCG 180 

GCTCAGGTGA CCACGCGGGT CTTCCGCAAC CTGGGCTTGG CGAACGTCCG CGAGGGCAAC 240 

GTCCGCAACG GTAATGTCCG GAACTTCAAT CTCGGCTCGG CCAACATCGG CAACGGCAAC 3 00 

ATCGGCAGCG GCAACATCGG CAGCTCCAAC ATCGGGTTTG GCAACGTGGG TCCTGGGTTG 3 60 

ACCGCAGCGC TGAACAACAT CGGTTTCGGC AACACCGGCA GCAACAACAT CGGGTTTGGC 420 

AACAC CGGCA GCAACAACAT CGGGTTCGGC AATAC CGGAG ACGGCAACCG AGGTATCGGG 48 0 

CTCACGGGTA GCGGTTTGTT GGGGTTCGGC GGCCTGAACT CGGGCACCGG CAACAT CGGT 540 

CTGTTCAACT CGGGCACCGG AAACGTCGGC ATCGGCAACT CGGGTACCGG GAACTGGGGC 500 

ATTGGCAACT CGGGCAACAG CTACAACACC GGTTT TGGC A ACTCCGGCGA CGCCAACACG 660 

GGCTTCTTCA ACTCCGGAAT AGCCAACACC GGCGTCGGCA ACGCCGGCAA CTACAACACC 72 0 

GGTAGCTACA ACCCGGGCAA CAGCAATACC GGCGGCTTCA ACATGGGCCA GTACAACACG 780 

GGCTACCTGA ACAGCGGCAA CTACAACACC GGCTTGGCAA ACTCCGGCAA TGTCAACACC 840 

GGCGCCTTCA TTACTGGCAA CTTCAACAAC GGCTTCTTGT GGCGCGGCGA CCACCAAGGC 900 

CTGATTTTCG GGAGCCCCGG CTTCTTCAAC TCGACCAGTG CGCCGTCGTC GGGATTCTTC 960 

AACAGC GGTG CCGGTAGCGC GTCCGGCTTC CTGAACTCCG GTGCCAACAA TTCTGGCTTC 1020 

TTCAACTCTT CGTCGGGGGC CATCGGTAAC TCCGGCCTGG CAAACGCGGG CGTGCTGGTA 1080 

TCGGGCGTGA TCAACTCGGG CAACACCGTA TCGGGTTTGT TCAACATGAG CCTGGTGGCC 1140 

ATCACAACGC CGGCCTTGAT CTCGGGCTTC TTCAACACCG GAAGCAACAT GTCGGGATTT 1200 

TTCGGTGGCC CACCGGTCTT CAATCTCGGC CTGGCAAACC GGGGCGTCGT GAACATTCTC 1260 

GGCAACGCCA ACATCGGCAA TTACAACATT CTCGGCAGCG GAAACGTCGG TGACTTCAAC 132 0 

ATCCTTGGCA GCGGCAACCT CGGCAGCCAA AACATCTTGG GCAGC GGCAA CGTCGGCAGC 13 80 

TTCAATATCG GCAGTGGAAA CATCGGAGTA TTCAATGTCG GTTCCGGAAG CCTGGGAAAC 1440 

TACAACATCG GATCCGGAAA CCTCGGGATC TACAACATCG GTTTTGGAAA CGTCGGCGAC 15 00 

TACAACGTCG GCTTCGGGAA CGCGGGCGAC TTCAACCAAG GCTTTGC CAA CACCGGCAAC 1560 

AACAACATCG GGTTCGCCAA CACCGGCAAC AACAACATCG GCATCGGGCT GTC CGGCGAC 1620 
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AACCAGCAGG 


GCTTCAATAT 


TGCTAGCGGC 


TGGAACTCGG 


GCAC CGG CAA 


CAG C GGC CTG 




TTCAATTCGG 


GCACCAATAA 


CGTTGGCATC 


TTCAACGCGG 


GCAC CGGAAA 


CGTCGGCATC 




GCAAACTCGG 


GCAC CGGGAA 


CTGGGGTATC 


GGGAACCCGG 


GTAC CGACAA 


1 At- L. Lj^L-tt. 1 \- 




CTCAATGCTG 


GCAGCTACAA 


CACGGGCATC 


CTCAACGCCG 


GCGACTTCAA 


CACGGGCTTC 




TACAACACGG 


GCAGCTACAA 


CACCGGCGGC 


TTCAACGTCG 


GTAACACCAA 


CACCGGCAAC 




TTCAACGTGG 


GTGAC AC CAA 


TACCGGCAGC 


TATAACCCGG 


GTGACACCAA 


CACCGGCTTC 




TTCAATCCCG 


GCAACGTCAA 


TACCGGCGCT 


TTCGACACGG 


GCGACTTCAA 


CAATGGCTTC 




TTGGTGGCGG 


GCGATAACCA 


GGGCCAGATT 


GCCATCGATC 


TCTCGGTCAC 


CACTCCATTC 




ATCCCCATAA 


ACGAGCAGAT 


GGTCATTGAC 


GTACACAACG 


TAATGACCTT 


CGGCGGCAAC 




ATGATCACGG 


TCACCGAGGC 


CTCGACCGTT 


TTCCCCCAAA 


CCTTCTATCT 


GAGCGGTTTG 




TTCTTCTTCG 


GCCCGGTCAA 


TCT CAGCGCA 


TCCACGCTGA 


CCGTTCCGAC 


GATCACCCTC 


2280 


ACCATCGGCG 


GAC CGACGGT 


GACCGTCCCC 


ATCAGCATTG 


TCGGTGCTCT 


GGAGAGCCGC 




ACGATTACCT 


TCCTCAAGAT 


CGATCCGGCG 


CCGGGCATCG 


GAAATTCGAC 


CACCAACCCC 




TCGTCCGGCT 


TCTTCAACTC 


GGGCACCGGT 


GGCACATCTG 


GCTTCCAAAA 


CGTCGGCGGC 




GGCAGTTCAG 


GCGTCTGGAA 


CAGTGGTTTG 


AGCAGCGCGA 


TAGGGAATTC 


GGGTTTCCAG 




AACCTCGGCT 


CGCTGCAGTC 


AGGCTGGGCG 


AACCTGGGCA ACTCCGTATC 


GGGCTTTTTC 




AACACCAGTA 


CGGTGAACCT 


CTCCACGCCG 


GCCAATGTCT 


CGGGCCTGAA 


CAACATCGGC 


2640 


ACCAACCTGT 


C CGG CGTGTT 


CCGCGGTCCG 


ACCGGGACGA 


TTTTCAACGC 


GGGCCTTGCC 


2700 


AACCTGGGCC 


AGTTGAACAT 


CGGCAGCGCC 


TCGTGCCGAA 


TTCGGCACGA 


GTTAGATACG 


2760 


GTTTCAACAA 


TCATATCCGC 


GTTTTGCGGC 


AGTGCAT CAG 


ACGAATCGAA 


CCCGGGAAGC 


2820 


GTAAGCGAAT 


AAACCGAATG 


GCGGCCTGTC 


AT 






2852 



(2) INFORMATION FOR SEQ ID NO: 2 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

Gly Gin Asn Ala Pro Ala lie Ala Ala Thr Glu Ala Ala Tyr Asp Gin 
15 10 15 
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Met Trp Ala Gin Asp Val Ala Ala Met Phe Gly Tyr His Ala Gly Ala 
20 25 30 

Ser Ala Ala Val Ser Ala Leu Thr Pro Phe Gly Gin Ala Leu Pro Thr 
35 40 45 

Val Ala Gly Gly Gly Ala Leu Val Ser Ala Ala Ala Ala Gin Val Thr 
50 55 60 

Thr Arg Val Phe Arg Asn Leu Gly Leu Ala Asn Val Arg Glu Gly Asn 
65 70 75 80 

Val Arg Asn Gly Asn Val Arg Asn Phe Asn Leu Gly Ser Ala Asn lie 



Gly Asn Gly Asn lie Gly Ser Gly Asn lie Gly Ser Ser Asn He Gly 
100 105 110 

Phe Gly Asn Val Gly Pro Gly Leu Thr Ala Ala Leu Asn Asn He Gly 
115 120 125 

Phe Gly Asn Thr Gly Ser Asn Asn He Gly Phe Gly Asn Thr Gly Ser 
130 135 140 

Asn Asn He Gly Phe Gly Asn Thr Gly Asp Gly Asn Arg Gly He Gly 
145 150 155 ISO 

Leu Thr Gly Ser Gly Leu Leu Gly Phe Gly Gly Leu Asn Ser Gly Thr 
165 170 175 

Gly Asn He Gly Leu Phe Asn Ser Gly Thr Gly Asn Val Gly He Gly 
180 185 190 

Asn Ser Gly Thr Gly Asn Trp Gly He Gly Asn Ser Gly Asn Ser Tyr 
195 200 205 

Asn Thr Gly Phe Gly Asn Ser Gly Asp Ala Asn Thr Gly Phe Phe Asn 
210 215 220 

Ser Gly He Ala Asn Thr Gly Val Gly Asn Ala Gly Asn Tyr Asn Thr 

225 230 235 240 

Gly Ser Tyr Asn Pro Gly Asn Ser Asn Thr Gly Gly Phe Asn Met Gly 
245 250 255 

Gin Tyr Asn Thr Gly Tyr Leu Asn Ser Gly Asn Tyr Asn Thr Gly Leu 
260 265 270 

Ala Asn Ser Gly Asn Val Asn Thr Gly Ala Phe He Thr Gly Asn Phe 
275 280 285 

Asn Asn Gly Phe Leu Trp Arg Gly Asp His Gin Gly Leu He Phe Gly 
290 295 300 

Ser Pro Gly Phe Phe Asn Ser Thr Ser Ala Pro Ser Ser Gly Phe Phe 
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Asn Ser Gly Ala Gly Ser Ala Ser Gly Phe Leu Asn Ser Gly Ala Asn 
325 330 335 

Asn Ser Gly Phe Phe Asn Ser Ser Ser Gly Ala Ile-Gly Asn Ser Gly 
340 345 350 

Leu Ala Asn Ala Gly Val Leu Val Ser Gly Val lie Asn Ser Gly Asn 
355 360 365 

Thr Val Ser Gly Leu Phe Asn Met Ser Leu Val Ala lie Thr Thr Pro 
370 375 380 

Ala Leu lie Ser Gly Phe Phe Asn Thr Gly Ser Asn Met Ser Gly Phe 
385 390 395 400 

Phe Gly Gly Pro Pro Val Phe Asn Leu Gly Leu Ala Asn Arg Gly Val 
405 410 415 

Val Asn lie Leu Gly Asn Ala Asn lie Gly Asn Tyr Asn He Leu Gly 
420 425 430 

Ser Gly Asn Val Gly Asp Phe Asn He Leu Gly Ser Gly Asn Leu Gly 
435 440 445 

Ser Gin Asn He Leu Gly Ser Gly Asn Val Gly Ser Phe Asn He Gly 
450 455 460 

Ser Gly Asn He Gly Val Phe Asn Val Gly Ser Gly Ser Leu Gly Asn 
4S5 470 475 480 

Tyr Asn He Gly Ser Gly Asn Leu Gly He Tyr Asn He Gly Phe Gly 
485 490 495 

Asn Val Gly Asp Tyr Asn Val Gly Phe Gly Asn Ala Gly Asp Phe Asn 
500 505 510 

Gin Gly Phe Ala Asn Thr Gly Asn Asn Asn He Gly Phe Ala Asn Thr 
515 520 525 

Gly Asn Asn Asn He Gly He Gly Leu Ser Gly Asp Asn Gin Gin Gly 
530 535 540 

Phe Asn He Ala Ser Gly Trp Asn Ser Gly Thr Gly Asn Ser Gly Leu 
545 550 555 560 

Phe Asn Ser Gly Thr Asn Asn Val Gly He Phe Asn Ala Gly Thr Gly 
565 570 575 

Asn Val Gly He Ala Asn Ser Gly Thr Gly Asn Trp Gly He Gly Asn 
580 585 590 



Pro Gly Thr Asp Asn Thr Gly He Leu Asn Ala Gly Ser Tyr Asn Thr 
595 600 605 
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Gly lie Leu Asn Ala Gly Asp Phe Asn Thr Gly Phe Tyr Asn Thr Gly 
610 615 620 

Ser Tyr Asn Thr Gly Gly Phe Asn Val Gly Asn Thr Asn Thr Gly Asn 
625 630 635 - 640 

Phe Asn Val Gly Asp Thr Asn Thr Gly Ser Tyr Asn Pro Gly Asp Thr 
645 650 655 

Asn Thr Gly Phe Phe Asn Pro Gly Asn Val Asn Thr Gly Ala Phe Asp 
660 665 670 

Thr Gly Asp Phe Asn Asn Gly Phe Leu Val Ala Gly Asp Asn Gin Gly 
675 680 685 

Gin lie Ala He Asp Leu Ser Val Thr Thr Pro Phe lie Pro He Asn 
690 695 700 

Glu Gin Met Val He Asp Val His Asn Val Met Thr Phe Gly Gly Asn 
705 710 715 720 

Met He Thr Val Thr Glu Ala Ser Thr Val Phe Pro Gin Thr Phe Tyr 

725 730 735 

Leu Ser Gly Leu Phe Phe Phe Gly Pro Val Asn Leu Ser Ala Ser Thr 
740 745 750 

Leu Thr Val Pro Thr He Thr Leu Thr He Gly Gly Pro Thr Val Thr 
755 760 765 

Val Pro He Ser He Val Gly Ala Leu Glu Ser Arg Thr He Thr Phe 
770 775 780 

Leu Lys He Asp Pro Ala Pro Gly He Gly Asn Ser Thr Thr Asn Pro 
785 790 795 800 

Ser Ser Gly Phe Phe Asn Ser Gly Thr Gly Gly Thr Ser Gly Phe Gin 
805 810 815 

Asn Val Gly Gly Gly Ser Ser Gly Val Trp Asn Ser Gly Leu Ser Ser 
820 825 830 

Ala He Gly Asn Ser Gly Phe Gin Asn Leu Gly Ser Leu Gin Ser Gly 
835 840 845 

Trp Ala Asn Leu Gly Asn Ser Val Ser Gly Phe Phe Asn Thr Ser Thr 
850 855 860 

Val Asn Leu Ser Thr Pro Ala Asn Val Ser Gly Leu Asn Asn He Gly 
865 870 875 880 

Thr Asn Leu Ser Gly Val Phe Arg Gly Pro Thr Gly Thr He Phe Asn 
885 890 895 
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Ala Gly Leu Ala Asn Leu Gly Gin Leu Asn lie Gly Ser Ala Ser Cys 
900 905 910 

Arg lie Arg His Glu Leu Asp Thr Val Ser Thr lie lie Ser Ala Phe 
915 920 925 

Cys Gly Ser Ala Ser Asp Glu Ser Asn Pro Gly Ser Val Ser Glu 
930 935 940 

(2) INFORMATION FOR SEQ ID NO -.205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:205: 
GGATC CATAT GGGCCATCAT CATCATCATC ACGTGATCGA CATCATCGGG ACC 
(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 
CCTGAATTCA GGCCTCGGTT GCGCCGGCCT CATCTTGAAC GA 
(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:207: 
GGATCCTGCA GGCTCGAAAC CACCGAGCGG T 
(2) INFORMATION FOR SEQ ID NO: 2 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: 



CTCTGAATTC AGCGCTGGAA ATCGTCGCGA T 
(2) INFORMATION FOR SEQ ID NO: 20 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209 
GGATCCAGCG CTGAGATGAA GACCGATGCC GCT 
(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210 
GGATATCTGC AGAATTCAGG TTTAAAGCCC ATTTGCGA 
(2) INFORMATION FOR SEQ ID NO: 2 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 

CCGCATGCGA GCCACGTGCC CACAACGGCC 

(2) INFORMATION FOR SEQ ID NO: 2 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 12 
CTTCATGGAA TTCTCAGGCC GGTAAGGTCC GCTGCGG 
(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7676 base pairs 

(B) TYPE: nucleic acid 
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<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 

TGGCGAATGG GACGCGCCCT GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG TGGTTACGCG 60 

CAGCGTGACC GCTACACTTG CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC 120 

CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG TCAAGCTCTA AATCGGGGGC TCCCTTTAGG 180 

GTTCCGATTT AGTGCTTTAC GGCACCTCGA CCCCAAAAAA CTTGATTAGG GTGATGGTTC 240 

ACGTAGTGGG CCATCGCCCT GATAGACGGT TTTTCGCCCT TTGACGTTGG AGTC CACGTT 300 

CTTTAATAGT GGACTCTTGT TCCAAACTGG AA.CAAC ACT C AACCCTATCT CGGTCTATTC 360 

TTTTGATTTA TAAGGGATTT TGCCGATTTC GGCCTATTGG TTAAAAAATG AGCTGATTTA 420 

ACAAAAATTT AACGCGAATT TTAACAAAAT ATTAACGTTT ACAATTTCAG GTGGCACTTT 480 

TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA 540 

TCCGCTCATG AATTAATTCT TAGAAAAACT CATCGAGCAT CAAATGAAAC TGCAATTTAT 6 00 

TCATATCAGG ATTATCAATA CCATATTTTT GAAAAAGCCG TTTCTGTAAT GAAGGAGAAA 660 

ACTCACCGAG GCAGTTCCAT AGGATGGCAA GATCCTGGTA TCGGTCTGCG ATTCCGACTC 720 

GTCCAACATC AATACAACCT ATTAAT TTC C CCTCGTCAAA AATAAGGTTA TCAAGTGAGA 780 

AATCACCATG AGTGACGACT GAATCCGGTG AGAATGGCAA AAGTTTATGC ATTTCTTTCC 84 0 

AGACTTGTTC AACAGGCCAG CCATTACGCT CGTCATCAAA ATCACTCGCA TCAACCAAAC 900 

CGTTATTCAT TCGTGATTGC GCCTGAGCGA GACGAAATAC GCGATCGCTG TTAAAAGGAC 960 

AATTACAAAC AGGAATCGAA TGCAACCGGC GCAGGAACAC TGCCAGCGCA TCAACAATAT 1020 

TTTCACCTGA ATCAGGATAT TCTTCTAATA CCTGGAATGC TGTTTTCCCG GGGATCGCAG 1080 

TGGTGAGTAA CCATGCATCA TCAGGAGTAC GGATAAAATG CTTGATGGTC GGAAGAGGCA 1140 

TAAATTCCGT CAGCCAGTTT AGTCTGACCA TCTCATCTGT AACATCATTG GCAACGCTAC 1200 

CTTTGCCATG TTTCAGAAAC AACTCTGGCG CATCGGGCTT CCCATACAAT CGATAGATTG 1260 

TCGCACCTGA TTGCCCGACA TTATCGCGAG CCCATTTATA CCCATATAAA TCAGCATCCA 1320 

TGTTGGAATT TAATCGCGGC CTAGAGCAAG ACGTTTCCCG TTGAATATGG CTCATAACAC 1380 

CCCTTGTATT ACTGTTTATG TAAGCAGACA GTTTTATTGT TCATGACCAA AATCCCTTAA 144 0 

CGTGAGTTTT CGTTCCACTG AGCGTCAGAC CCCGTAGAAA AG AT CAAAGG ATCTTCTTGA 1500 
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GATCCTTTTT TTCTGCGCGT AATCTGCTGC TTGCAAACAA AAAAACCACC GCTACCAGCG X560 

GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC 1620 

AGAGCGCAGA TACCAAATAC TGTCCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 1680 

AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC 1740 

AGTGGCGATA AGTCGTGTCT TAC CGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG 18 00 

CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC 186 0 

ACCGAACTGA GATACCTACA GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 1920 

AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT 1980 

CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG 204 0 

CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG 210 0 

GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA 216 0 

TCCCCTGATT CTGTGGATAA CCGTATTACC GCCTTTGAGT GAGCTGATAC CGCTCGCCGC 2220 

AGCCGAACGA CCGAGCGCAG CGAGTCAGTG AGCGAGGAAG CGGAAGAGCG CCTGATGCGG 228 0 

TATTTTCTCC TTACGCATCT GTGCGGTATT TCACACCGCA TATATGGTGC ACTCTCAGTA 2340 

CAATCTGCTC TGATGCCGCA TAGTTAAGCC AGTATACACT CCGCTATCGC TACGTGACTG 2400 

GGTCATGGCT GCGCCCCGAC ACCCGCCAAC ACCCGCTGAC GCGCCCTGAC GGGCTTGTCT 2460 

GCTCCCGGCA TCCGCTTACA GACAAGC TGT GACCGTCTCC GGGAGCTGCA TGTGTCAGAG 252 0 

GTTTTCACCG TCATCACCGA AACGCGCGAG GCAGCTGCGG TAAAGCTCAT CAGCGTGGTC 2580 

GTGAAGCGAT TCACAGATGT CTGC CTGTTC ATCCGCGTCC AGCTCGTTGA GTTTCTCCAG 2640 

AAGCGTTAAT GTCTGGCTTC TGATAAAGCG GGCCATGTTA AGGGCGGTTT TTTCCTGTTT 2700 

GGTCACTGAT GCCTCCGTGT AAGGGGGATT TCTGTTCATG GGGGTAATGA TACCGATGAA 2760 

ACGAGAGAGG ATGCTCACGA TACGGGTTAC TGATGATGAA CATGCCCGGT TACTGGAACG 2820 

TTGTGAGGGT AAACAACTGG CGGTATGGAT GCGGCGGGAC CAGAGAAAAA TCACTCAGGG 288 0 

TCAATGCCAG CGCTTCGTTA ATACAGATGT AGGTGTTCCA CAGGGTAGCC AGCAGCATCC 294 0 

TGCGATGCAG ATCCGGAACA TAATGGTGCA GGGCGCTGAC TTCCGCGTTT CCAGACTTTA 3000 

CGAAACACGG AAACCGAAGA CCATTCATGT TGTTGCTCAG GTCGCAGACG TTTTGCAGCA 3060 

GCAGTCGCTT CACGTTCGCT CGCGTATCGG TGATTCATTC TGCTAACCAG TAAGGCAACC 312 0 

CCGCCAGCCT AGCCGGGTCC TCAACGACAG GAGCACGATC ATGCGCACCC GTGGGGCCGC 3180 
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CATGCCGGCG ATAATGGCCT GCTTCTCGCC GAAACGTTTG GTGGCGGGAC CAGTGACGAA 3240 

GGCTTGAGCG AGGGCGTGCA AGATTCCGAA TACCGCAAGC GACAGGCCGA TCATCGTCGC 33 00 

GCTCCAGCGA AAGCGGTCCT CGCCGAAAAT GACCCAGAGC GCTGCCGGGA CCTGTCCTAC 3360 

GAGTTGCATG ATAAAGAAGA CAGTCATAAG TGCGGCGACG ATAGTCATGC CCCGCGCCCA 3420 

CCGGAAGGAG CTGACTGGGT TGAAGGCTCT CAAGGGCATC GGTCGAGATC CCGGTGCCTA 3480 

ATGAGTGAGC TAACTTACAT TAATTGCGTT GCGCTCACTG CCCGCTTTCC AGTCGGGAAA 3 540 

CCTGTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG GGGAGAGGCG GTTTGCGTAT 3600 

TGGGCGCCAG GGTGGTTTTT CTTTTCACCA GTGAGACGGG CAACAGCTGA TTGCCCTTCA 3660 

CCGCCTGGCC CTGAGAGAGT TGCAGCAAGC GGTCCACGCT GGTTTGCCCC AGCAGGCGAA 3720 

AATCCTGTTT GATGGTGGTT AACGGCGGGA TATAACATGA GCTGTCTTCG GTATCGTCGT 3 78 0 

ATCCCACTAC CGAGATATCC GCACCAACGC GCAGCCCGGA CTCGGTAATG GCGCGCATTG 3 840 

CGCCCAGCGC CATCTG AT C G TTGGCAACCA GCATCGCAGT GGGAACGATG CCCTCATTCA 3 900 

GCATTTGCAT GGTTTGTTGA AAACCGGACA TGGCACTCCA GTCGCCTTCC CGTTCCGCTA 3 960 

TCGGCTGAAT TTGATTGCGA GTGAGATATT TATGCCAGCC AGCCAGACGC AGACGCGCCG 4 020 

AGACAGAACT TAATGGGCCC GCTAACAGCG CGATTTGCTG GTGACCCAAT GCGACCAGAT 4080 

GCTCCACGCC CAGTCGCGTA CCGTCTTCAT GGGAGAAAAT AATACTGTTG ATGGGTGTCT 4140 

GGTCAGAGAC ATCAAGAAAT AACGCCGGAA CATTAGTGCA GGCAGCTTCC ACAGCAATGG 4200 

CATCCTGGTC ATCCAGCGGA TAGTTAATGA TCAGCCCACT GACGCGTTGC GCGAGAAGAT 4260 

TGTGCACCGC CGCTTTACAG GCTTCGACGC CGCTTCGTTC TACCATCGAC ACCACCACGC 4320 

TGGCACCCAG TTGATCGGCG CGAGATTTAA TCGCCGCGAC AATTTGCGAC GGCGCGTGCA 43 8 0 

GGGCCAGACT GGAGGTGGCA ACGCCAATCA GCAACGACTG TTTGCCCGCC AGTTGTTGTG 444 0 

CCACGCGGTT GGGAATGTAA TTCAGCTCCG CCATCGCCGC TTCCACTTTT TCCCGCGTTT 4500 

TCGCAGAAAC GTGGCTGGCC TGGTTCACCA CGCGGGAAAC GGTCTGATAA GAGACACCGG 456 0 

CATACTCTGC GACATCGTAT AACGTTACTG GTTTCACATT CACCACCCTG AATTGACTCT 4620 

CTTCCGGGCG CTATCATGCC ATACCGCGAA AGGTTTTGCG CCATTCGATG GTGTCCGGGA 4680 

TCTCGACGCT CTCCCTTATG CGACTCCTGC ATTAGGAAGC AGCCCAGTAG TAGGTTGAGG 4740 

CCGTTGAGCA CCGCCGCCGC AAGGAATGGT GCATGCAAGG AGATGGCGCC CAACAGTCCC 4800 



204 



CCGGCCACGG GGCCTGCCAC CATACCCACG CCGAAACAAG CGCTCATGAG CCCGAAGTGG 4860 

CGAGCCCGAT CTTCCCCATC GGTGATGTCG GCGATATAGG CGCCAGCAAC CGCAC CTGTG 4920 

GCGCCGGTGA TGCCGGCCAC GATGCGTCCG GCGTAGAGGA TCGAGATCTC GATCCCGCGA 4980 

AATTAATACG ACTCACTATA GGGGAATTGT GAGCGGATAA CAATTCCCCT CTAGAAATAA 5040 

TTTTGTTTAA CTTTAAGAAG GAGATATACA TATGGGCCAT CATCATCATC ATCACGTGAT 5100 

CGACATCATC GGGACCAGCC CCACATCCTG GGAACAGGCG GCGGCGGAGG CGGTCCAGCG 5160 

GGCGCGGGAT AGCGTCGATG ACATCCGCGT CGCTCGGGTC ATTGAGCAGG ACATGGCCGT 522 0 

GGACAGCGCC GGCAAGATCA CCTACCGCAT CAAGCTCGAA GTGTCGTTCA AGATGAGGCC 528 0 

GGCGCAACCG AGGGGCTCGA AACCACCGAG CGGTTCGCCT GAAACGGGCG CCGGCGCCGG 5340 

TACTGTCGCG ACTACCCCCG CGTCGTCGCC GGTGACGTTG GCGGAGACCG GTAGCACGCT 5400 

GCTCTACCCG CTGTTCAACC TGTGGGGTCC GGCCTTTCAC GAGAGGTATC CGAACGTCAC 5460 

GATCACCGCT CAGGGCACCG GTTCTGGTGC CGGGATCGCG CAGGCCGCCG CCGGGACGGT 552 0 

CAACATTGGG GCCTCCGACG CCTATCTGTC GGAAGGTGAT ATGGCCGCGC ACAAGGGGCT 5 58 0 

GATGAACATC GCGCTAGCCA TCTCCGCTCA GCAGGTCAAC TACAACCTGC CCGGAGTGAG 564 0 

CGAGCACCTC AAGCTGAACG GAAAAGTCCT GGCGGCCATG TACCAGGGCA CCATCAAAAC 5700 

CTGGGACGAC CCGCAGATCG CTGCGCTCAA CCCCGGCGTG AACCTGCCCG GCACCGCGGT 576 0 

AGTTCCGCTG CACCGCTCCG ACGGGTCCGG TGACACCTTC TTGTTCACCC AGTAC CTGTC 582 0 

CAAGCAAGAT CCCGAGGGCT GGGGCAAGTC GCCCGGCTTC GGCACCACCG TCGACTTCCC 5880 

GGCGGTGCCG GGTGCGCTGG GTGAGAACGG CAACGGCGGC ATGGTGACCG GTTGCGCCGA 594 0 

GACACCGGGC TGCGTGGCCT ATATCGGCAT CAGCTTCCTC GACCAGGCCA GTCAACGGGG 6000 

ACTCGGCGAG GCCCAACTAG GCAATAGCTC TGGCAATTTC TTGTTGCCCG ACGCGCAAAG 6060 

CATTCAGGCC GCGGCGGCTG GCTTCGCATC GAAAACCCCG GCGAACCAGG CGATTTCGAT 6120 

GATCGACGGG CCCGCCCCGG ACGGCTACCC GATCATCAAC TACGAGTACG CCATCGTCAA 618 0 

CAACCGGCAA AAGGACGCCG CCACCGCGCA GACCTTGCAG GCATTTCTGC ACTGGGCGAT 624 0 

CACCGACGGC AACAAGGCCT CGTTCCTCGA CCAGGTTCAT TTCCAGCCGC TGCCGCCCGC 6300 

GGTGGTGAAG TTGTCTGACG CGTTGATCGC GACGATTTCC AGCGCTGAGA TGAAGACCGA 6360 

TGCCGCTACC CTCGCGCAGG AGGCAGGTAA TTTCGAGCGG ATCTCCGGCG ACCTGAAAAC 6420 

CCAGATCGAC CAGGTGGAGT CGACGGCAGG TTCGTTGCAG GGCCAGTGGC GCGGCGCGGC 6480 
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GGGGACGGCC GCCCAGGCCG CGGTGGTGCG CTTCCAAGAA GCAGCCAATA AGCAGAAGCA 6540 

GGAACTCGAC GAGATCTCGA CGAATATTCG TCAGGCCGGC GTCCAATACT CGAGGGCCGA 6600 

CGAGGAGCAG CAGCAGGCGC TGTCCTCGCA AATGGGCTTT GTGCCCACAA CGGCCGCCTC 6660 

GCCGCCGTCG ACCGCTGCAG CGCCACCCGC ACCGGCGACA CCTGTTGCCC CCCCACCACC 6720 

GGCCGCCGCC AACACGCCGA ATGCCCAGCC GGGCGATCCC AACGCAGCAC CTCCGCCGGC 6780 

CGACCCGAAC GCACCGCCGC CACCTGTCAT TGCCCCAAAC GCACCCCAAC CTGTCCGGAT 6840 

CGACAACCCG GTTGGAGGAT TCAGCTTCGC GCTGCCTGCT GGCTGGGTGG AGTCTGACGC 6900 

CGCCCACTTC GACTACGGTT CAGCACTCCT CAGCAAAACC ACCGGGGACC CGCCATTTCC 6960 

CGGACAGCCG CCGCCGGTGG CCAATGACAC CCGTATCGTG CTCGGCCGGC TAGACCAAAA 702 0 

GCTTTACGCC AGCGCCGAAG CCACCGACTC CAAGGC CGCG GCCCGGTTGG GCTCGGACAT 7080 

GGGTGAGTTC TATATGCCCT ACCCGGGCAC CCGGATCAAC CAGGAAACCG TCTCGCTTGA 714 0 

CGCCAACGGG GTGTCTGGAA GCGCGTCGTA TTACGAAGTC AAGTTCAGCG ATCCGAGTAA 72 00 

GCCGAACGGC CAGATCTGGA CGGGCGTAAT CGGCTCGCCC GCGGCGAACG CACCGGACGC 7260 

CGGGCCCCCT CAGCGCTGGT TTGTGGTATG GCTCGGGACC GCCAACAACC CGGTGGACAA 7320 

GGGCGCGGCC AAGGCGCTGG CCGAATCGAT CCGGCCTTTG GTCGCCCCGC CGCCGGCGCC 7380 

GGCACCGGCT CCTGCAGAGC CCGCTCCGGC GCCGGCGCCG GCCGGGGAAG TCGCTCCTAC 7440 

CCCGACGACA CCGACACCGC AGCGGACCTT ACCGGCCTGA GAATTCTGCA GATATCCATC 7500 

ACACTGGCGG CCGCTCGAGC ACCACCACCA CCACCACTGA GATCCGGCTG CTAACAAAGC 7560 

CCGAAAGGAA GCTGAGTTGG CTGCTGCCAC CGCTGAGCAA TAACTAGCAT AACCCCTTGG 7620 

GGCCTCTAAA CGGGTCTTGA GGGGTTTTTT GCTGAAAGGA GGAACTATAT CCGGAT 7676 
(2) INFORMATION FOR SEQ ID NO: 2 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:214: 

Met Gly His His His His His His Val He Asp He He Gly Thr Ser 
15 10 15 

Pro Thr Ser Trp Glu Gin Ala Ala Ala Glu Ala Val Gin Arg Ala Arg 
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20 25 30 

Asp Ser Val Asp Asp lie Arg Val Ala Arg Val lie Glu Gin Asp Met 
35 40 45 

Ala Val Asp Ser Ala Gly Lys lie Thr Tyr Arg Ile-Lys Leu Glu Val 

50 55 60 

Ser Phe Lys Met Arg Pro Ala Gin Pro Arg Gly Ser Lys Pro Pro Ser 



Gly Ser Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro 
85 90 95 

Ala Ser Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr 
100 105 110 

Pro Leu Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn 
115 120 125 

Val Thr lie Thr Ala Gin Gly Thr Gly Ser Gly Ala Gly He Ala Gin 
130 135 140 

Ala Ala Ala Gly Thr Val Asn He Gly Ala Ser Asp Ala Tyr Leu Ser 
145 150 155 160 

Glu Gly Asp Met Ala Ala His Lys Gly Leu Met Asn He Ala Leu Ala 
165 170 175 

He Ser Ala Gin Gin Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His 
180 185 190 

Leu Lys Leu Asn Gly Lys Val Leu Ala Ala Met Tyr Gin Gly Thr He 
195 200 205 

Lys Thr Trp Asp Asp Pro Gin He Ala Ala Leu Asn Pro Gly Val Asn 
210 215 220 

Leu Pro Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly 
225 230 235 240 

Asp Thr Phe Leu Phe Thr Gin Tyr Leu Ser Lys Gin Asp Pro Glu Gly 
245 250 255 

Trp Gly Lys Ser Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val 
260 265 270 

Pro Gly Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys 
275 280 285 

Ala Glu Thr Pro Gly Cys Val Ala Tyr He Gly He Ser Phe Leu Asp 
290 295 300 

Gin Ala Ser Gin Arg Gly Leu Gly Glu Ala Gin Leu Gly Asn Ser Ser 
305 310 315 320 
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Gly Asn Phe Leu Leu Pro Asp Ala Gin Ser lie Gin Ala Ala Ala Ala 
325 330 335 

Gly Phe Ala Ser Lys Thr Pro Ala Asn Gin Ala lie Ser Met lie Asp 
340 345 - 350 

Gly Pro Ala Pro Asp Gly Tyr Pro lie lie Asn Tyr Glu Tyr Ala lie 
355 3S0 365 

Val Asn Asn Arg Gin Lys Asp Ala Ala Thr Ala Gin Thr Leu Gin Ala 
370 375 380 

Phe Leu His Trp Ala He Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp 
385 390 395 400 

Gin Val His Phe Gin Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp 
405 .410 415 

Ala Leu He Ala Thr He Ser Ser Ala Glu Met Lys Thr Asp Ala Ala 
420 425 430 

Thr Leu Ala Gin Glu Ala Gly Asn Phe Glu Arg He Ser Gly Asp Leu 
435 440 445 

Lys Thr Gin He Asp Gin Val Glu Ser Thr Ala Gly Ser Leu Gin Gly 
450 455 460 

Gin Trp Arg Gly Ala Ala Gly Thr Ala Ala Gin Ala Ala Val Val Arg 
465 470 475 480 

Phe Gin Glu Ala Ala Asn Lys Gin Lys Gin Glu Leu Asp Glu He Ser 
485 490 495 

Thr Asn He Arg Gin Ala Gly Val Gin Tyr Ser Arg Ala Asp Glu Glu 
500 505 510 

Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe Val Pro Thr Thr Ala 
515 520 525 

Ala Ser Pro Pro Ser Thr Ala Ala Ala Pro Pro Ala Pro Ala Thr Pro 
530 535 540 

Val Ala Pro Pro Pro Pro Ala Ala Ala Asn Thr Pro Asn Ala Gin Pro 
545 550 555 560 

Gly Asp Pro Asn Ala Ala Pro Pro Pro Ala Asp Pro Asn Ala Pro Pro 
565 570 575 

Pro Pro Val lie Ala Pro Asn Ala Pro Gin Pro Val Arg He Asp Asn 
580 585 590 



Pro Val Gly Gly Phe Ser Phe Ala Leu Pro Ala Gly Trp Val Glu Ser 
595 600 605 
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Asp Ala Ala His Phe Asp Tyr Gly Ser Ala Leu Leu Ser Lys Thr Thr 
610 615 620 

Gly Asp Pro Pro Phe Pro Gly Gin Pro Pro Pro Val Ala Asn Asp Thr 
625 630 635 640 

Arg lie Val Leu Gly Arg Leu Asp Gin Lys Leu Tyr Ala Ser Ala Glu 
645 650 655 

Ala Thr Asp Ser Lys Ala Ala Ala Arg Leu Gly Ser Asp Met Gly Glu 
660 665 670 

Phe Tyr Met Pro Tyr Pro Gly Thr Arg lie Asn Gin Glu Thr Val Ser 
675 680 685 

Leu Asp Ala Asn Gly Val Ser Gly Ser Ala Ser Tyr Tyr Glu Val Lys 
690 695 700 

Phe Ser Asp Pro Ser Lys Pro Asn Gly Gin lie Trp Thr Gly Val lie 
705 710 715 720 

Gly Ser Pro Ala Ala Asn Ala Pro Asp Ala Gly Pro Pro Gin Arg Trp 
725 730 735 

Phe Val Val Trp Leu Gly Thr Ala Asn Asn Pro Val Asp Lys Gly Ala 
740 745 750 

Ala Lys Ala Leu Ala Glu Ser lie Arg Pro Leu Val Ala Pro Pro Pro 
755 760 765 

Ala Pro Ala Pro Ala Pro Ala Glu Pro Ala Pro Ala Pro Ala Pro Ala 
770 775 780 

Gly Glu Val Ala Pro Thr Pro Thr Thr Pro Thr Pro Gin Arg Thr Leu 
785 790 795 800 

Pro Ala 

(2) INFORMATION FOR SEQ ID NO:215: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

GTGGCGGCGC TGCGGCCGGC CAGCAGAGCG ATGTGCATCC GTTCGCGAAC CTGATCGCGG 60 

TCGACGATGA GCGCGCCGAA CGCCGCGACG ACGAAGAACG TCAGGAAGCC GTCCAGCAGC 120 

GCGGTCCGCG CGGTGACGAA GCTGACCCCG TCGCAGATCA GCAGCACCCC GGCGATGGCG 180 

CCGACCAATG TCGACCGGCT GATCCGCCGC ACGATCCGCA CCACCAGCGC CACCAGGACC 240 

ACACCCAGCA GGGCGCCGGT GAACCGCCAG CCGAATCCGT TGTGACCGAA GATGGCCTCC 3 00 
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CCGATCGCGA TCAGCTGCTT ACCGACCGGC GGGTGAACCA CCAGGCCGTA CCCGGGGTTG 360 
TCTTCCACCC CATGGTTGTT CAGCACCTGC CAGGCCTGGC GGTGCGTAAT GCTTCTCGTC 420 
GAAGATGGGG GTGCCGGCAT CCGTCACCGA GCCC 454 

(2) INFORMATION FOR SEQ ID NO: 2 IS: 

( i ) S EQUENCE CHARACTER I STI CS : 

(A) LENGTH: 470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

TGCAGAAGTA CGGCGGATCC TCGGTGGCCG ACGCCGAACG GATTCGCCGC GTCGCCGAAC 60 

GCATCGTCGC CACCAAGAAG CAAGGCAATG ACGTCGTCGT CGTCGTCTCT GCCATGGGGG 120 

ATACCACCGA CGACCTGCTG GATCTGGCTC AGCAGGTGTG CCCGGCGCCG CCGCCTCGGG 180 

AGCTGGACAT GCTGCTTACC GCCGGTGAAC GCATCTCGAA TGCGTTGGTG GCCATGGCCA 240 

TCGAGTCGCT CGGCGCGCAT GCCCGGTCGT TCACCGGTTC GCAGGCCGGG GTGATCACCA 300 

CCGGCACCCA CGGCAACGCC AAGATCATCG ACGTCACGCC GGGGCGGCTG CAAACCGCCC 360 

TTGAGGAAGG GCGGGTCGTC TTGGTGGCCG GATTCCAAGG GGTCAGCCAG GACACCAAGG 420 

ATGTCACGAC GTTGGGCCGC GGCGGCTCGG ACACCACCGC CGTCGCCATG 470 

(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

GGCCGGCGTA CCCGGCCGGG ACAAACAACG ATCGATTGAT ATCGATGAGA GACGGAGGAA 60 

TCGTGGCCCT TCCCCAGTTG ACCGACGAGC AGCGCGCGGC CGCGTTGGAG AAGGCTGCTG 120 

CCGCACGTCG AGCGCGAGCA GAGCTCAAGG ATCGGCTCAA GCGTGGCGGC ACCAACCTCA 180 

CCCAGGTCCT CAAGGAC GCG GAGAGCGATG AAGTCTTGGG CAAAATGAAG GTGTCTGCGC 240 

TGCTTGAGGC CTTGCCAAAG GTGGGCAAGG TCCAGGCGC 279 

(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:218: 
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ACACGGTCGA ACTCGACGAG CCCCTCGTGG AGGTGTCGAC CGACAAGGTC GACACCGAAA SO 

TCCCTCGCCG GCCGCGGGTG TGCTGACCAA GATCATCGCC CAAGAAGATG ACACGGTCGA 120 

GGTCGGCGGC GAGCTCTCTG TCATTGGCGA CGCCCATGAT GCCGGCGAGG CCGCGGTCCC 18 0 

GGCACCCCAG AAAGTCTCTG CCGGCCCAAC CCGAATCCA 219 

(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:219: 

TCGCTGCCGA CATCGGCGCC GCGCCCGCCC CCAAGCCCGC ACCCAAGCCC GTCCCCGAGC 60 

CAGCGCCGAC GCCGAAGGCC GAACCCGCAC CATCGCCGCC GGCGGCCCAG CCAGCCGGTG 120 

CGGCCGAGGG CGCACCGTAC GTGACGCCGC TGGTGCGAAA GCTGGCGTCG GAAAACAACA 18 0 

TCGACCTCGC CGGGGTGACC GGCACCGGAG TGGGTGGTCG CATCCGCAAA CAGGATGTGC 240 

TGGCCGCGGC TGAACAAAAG AAGCGGGCGA AAGCACCGGC GCCGGCCGCC CAGGCCGCCG 3 00 

CCGCGCCGGC CCCGAAAGCG CCGCCTGAAG ATCCGATGCC GC 342 

(2) INFORMATION FOR SEQ ID NO: 22 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

GGGTCTTGGT CAGTATCAGC GCCGACGAGG ACGCCACGGT GCCCGTCGGC GGCGAGTTGG 60 

CCCGGATCGG TGTCGCTGCC GACATCGGCG CCGCGCCCGC CCCCAAGCCC GCACCCAAGC 120 

CCGTCCCCGA GCCAGCGCCG ACGCCGAAGG CCGAACCCGC ACCATCGCCG CCGGCGGCCC 180 

AGC CAGCCGG TGCGGCCGAG GGCGCACCGT ACGTGACGCC GCTGGTGCGA AAGCTGGCGT 240 

CGGAAAACAA CATCGACCTC GCCGGGGTGA CCGGCACCGG AGTGGGTGGT CGCATCCGCA 3 00 

AACAGGATGT GCTGGC CGCG GCTGAACAAA AGAAGCGGGC GAAAGCACCG GCGCCCTGAG 3 60 

CGCTTCATCA CCCGGTTAAC CAGCTTGCCC CAGAAGCCGG CTTCGACCTC TTCGCGGGTC 420 

TTGGTCCGCT GCAGGCGGTC GGCGAGCCAG TTCAGGTTAG GCGGCCGAAA TCTTCCAGTT 480 

CGCCAGGAAG GGCACCCGGA ACAGGGTCCG CACCC 515 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 



CCGACCCCAA GGTGCAGATT CAACAGGCCA TTGAGGAAGC ACAGCGCACC CACCAAGCGC 60 

TGACTCAACA GGCGGCGCAA GTGATCGGTA ACCAGCGTCA ATTGGAGATG CGACTCAACC 120 

GACAGCTGGC GGACATCGAA AAGCTTCAGG TCAATGTGCG CCAAGCCCTG ACGCTGGCCG 180 

ACCAGGCCAC CGCCGCCGGA GACGCTGCCA AGGCCACCGA ATACAACAAC GCCGCCGAGG 240 

CGTTCGCAGC CCAGCTGGTG ACCGCCGAGC AGAGCGTCGA AGACCTCAAG ACGCTGCATG 300 

ACCAGGCGCT TAGCGCCGCA GCTCAGGCCA AGAAGGCCGT CGAACGAAAT GCGATGGTGC 360 

TGCAGCAGAA GATCGCCGAG CGAACCAAGC TGCTCAGCCA GCTCGAGCAG GCGAAGATGC 420 

AGGAGCAGGT CAGCGCATCG TTGCGGTCGA TGAGTGAGCT CGCCGCGCCA GGCAACACGC 480 

CGAGCCTCGA CGAGGTGCGC GACAAGATCG AGCGTCGCTA CGCCAACGCG ATCGGTTCGG 540 

CTGAACTTGC CGAGAGT 557 



(2) INFORMATION FOR SEQ ID NO: 222: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 



CAGGATAGGT TTCGACATCC ACCTGGGTTC CGCACCCGGT GCGCGACCGT GTGATAGGCC 60 

AGAGGTGGAC CTGCGCCGAC CGACGATCGA TCGAGGAGTC AACAGAAATG GCCTTCTCCG 12 0 

TCCAGATGCC GGCACTCGGT GAGAGCGTCA CCGAGGGGAC GGTTACCCGC TGGCTCAAAC 180 

AGGAAGGCGA CACGGTCGAA CTCGACGAGC CCCTCGTGGA GGT 223 



(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:223: 



AAGAAGTACA TCTGCCGGTC GATGTCGGCG AACCACGGCA GCCAACCGGC GCAGTAGCCG 60 

ACCAGGACCA CCGCATAACG CCAGTCCCGG CGCACAAACA TACGCCACCC CGCGTATGCC 12 0 

AGGACTGGCA CCGCCAGCCA CCACATCGCG GGCGTGCCGA CCAGCATCTC GGCCTTGACG 18 0 

CACGACTGTG CGCCGCAGCC TGCAACGTCT TGCTGGTCGA TGGCGTACAG CACCGGCCGC 240 

AACGACATGG GCCAGGTCCA CGGTTTGGAT TCCCAAGGGT GGTAGTTGCC TGCGGAATTC 300 

GTCAGGCCCG CGTGGAAGTG GAACGCTTTG GCGGTGTATT GCCAGAGCGA GCGCACGGCG 360 

TCGGGCAGCG GAACAACCGA GTTGCGACCG ACCGCTTGAC CGACCGCATG CCGATCGATC 42 0 

GCGGTCTCGG ACGCGAACCA CGGAGCGTAG GTGGC CAGAT AGACCGCGAA CGGGATCAAC 48 0 

CCCAGCGCAT ACCCGCTGGG AAGCACGTCA CGCCGCACTG TTCCCAGCCA CGGTCTTTGC 54 0 

ACTTGGTATG AACGTCGCGC CGCCACGTCA ACGCCAGC 578 



(2) INFORMATION FOR SEQ ID NO: 224: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 



ACAACGATCG ATTGATATCG ATGAGAGACG GAGGAATCGT GGCCCTTCCC CAGTTGACCG 60 

ACGAGCAGCG CGCGGCCGCG TTGGAGAAGG CTGCTGCCGC ACGTCGAGCG CGAGCAGAGC 120 

TCAAGGATCG GCTCAAGCGT GGCGGCACCA ACCTCACCCA GGTCCTCAAG GACGCGGAGA 180 

GCGATGAAGT CTTGGGCAAA ATGAAGGTGT CTGCGCTGCT TGAGGCCTTG CCAAAGGTGG 240 

GCAAGGTCAA GGCGCAGGAG ATCATGACCG AGCTGGAAAT TGCGCCCCAC CCCGCCGCCT 3 00 

TCGTGGCCTC GGTGACCGTC AGCGCAAGGC CCTGCTGGAA AAGTTCGGCT CCGCCTAACC 360 

CCGCCGGCCG ACGATGCGGG CCGGAAGGCC TGTGGTGGGC GTACCCCCGC ATACGGGGGA 420 

GAAGCGGCCT GACAGGGCCA GCTCACAATT CAGGCCGAAC GCCCCGGTGG GGGGGAACCC 4 80 

GCCC 484 



(2) INFORMATION FOR SEQ ID NO: 225: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:225: 



AGGACTGGCA CCGCCAGCCA CCACATCGCG GGCGTGCCGA CCAGCATCTC GGCCTTGACG 60 

CACGACTGTG CGCCGCAGCC TGCAACGTCT TGCTGGTCGA TGGCGTACAG CACCGGCCGC 120 

AACGACATGG GCCAGGTCCA CGGTTTGGAT TCCCAAGGGT GGTAGTTGCC TGCGGAATTC 180 

GTCAGGCCCG CGTGGAAGTG GAACGCTTTG GCGGTGTAGT GCCAGAGCGA GCGCACGGCG 240 

TCGGGCAGCG GAACAACCGA GTTGCGACCG ACCGCTTGAC CGACCGCATG CCGATCGATC 300 

GCGGTCTCGG ACGCGAACCA CGGAGC GTAG GTGGCCAGAT AGACCGCGAA CGGGATCAAC 360 

CCCAGCGCAT ACCCGCTGGG AAGCACGTCA CGCCGCACTG TCCCCAGCCA CGGTCTTTGC 420 

ACTTGGTACT GACGTCGCGC CGCCACGTCG AACGCCAGCG CCATCGCGCC GAAGAACAGC 480 

ACGAAGTACA CGCCGGACCA CTTGGTGGCG CAAGCCAATC CCAAGCAGCA CCCCGGC 53 7 



(2) INFORMATION FOR SEQ ID NO: 22 6: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:226: 
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Gly 


Gly 


Ala 


Ala 


Ala 


Gly 


Gin 


Gin 


Ser 


Asp Val 


His Pro 


Phe 


Ala 


Asn 


1 








5 










10 






15 




Leu 


lie 


Ala 


Val 


Asp 


Asp 


Glu 


Arg 


Ala Glu Arg Arg Asp 


Asp 


Glu 


Glu 








20 










25 






30 






Arg 


Gin 


Glu 


Ala 


Val 


Gin 


Gin 


Arg 


Gly Pro Arg Gly Asp 


Glu 


Ala Asp 






35 










40 






45 








Pro 


Val 


Ala 


Asp 


Gin 


Gin 


His 


Pro 


Gly Asp Gly Ala Asp 


Gin 


Cys 


Arg 




50 










55 








60 








Pro 


Ala 


Asp 


Pro 


Pro 


His 


Asp 


Pro 


His 


His Gin Arg His 


Gin 


Asp 


His 


65 










70 








75 








80 


Thr 


Gin 


Gin 


Gly 


Ala 


Gly 


Glu 


Pro 


Pro 






Val 


Thr 


Glu 










85 










90 






95 




Asp 


Gly 


Leu 


Pro 


Asp 


Arg 


Asp 


Gin 


Leu 


Leu Thr 


Asp Arg 


Arg 


Val 


Asn 








100 










105 






110 






His 


Gin 


Ala 


Val 


Pro 


Gly 


Val 


Val 


Phe 


His Pro 


Met Val 


Val 


Gin 


His 






115 










120 






125 








Leu 


Pro 


Gly 


Leu 


Ala 


Val 


Arg 


















130 










135 
















(2) INFORMATION FOR SEQ ID NO: 227: 














(i) SEQUENCE CHARACTERISTICS: 
















(A) 


LENGTH: 


156 


amino acids 
















(B) 


TYPE : amino 


acid 


















(C) 


S TRANDEDNE S S : s ingl e 
















(D) 


TOPOLOGY: linear 
















(ii) MOLECULE 


TYPE : protein 














(> 


ci) SEQUENCE 


DESCRIPTION; 


: SEQ ID NO: 227: 








Gin 


Lys 


Tyr 


Gly 


Gly 


Ser 


Ser 


Val 


Ala 


Asp Ala 


Glu Arg 


He 


Arg 


Arg 


1 








5 










10 






15 




Val 


Ala 


Glu 


Arg 


He 


Val 


Ala 


Thr 


Lys 


Lys Gin 


Gly Asn 


Asp 


Val 


Val 








20 










25 






30 






Val 


Val 


Val 


Ser 


Ala 


Met 


Gly 


Asp 


Thr 


Thr Asp 


Asp Leu 


Leu 


Asp 


Leu 






35 










40 






45 








Ala 


Gin 


Gin 


Val 


Cys 


Pro 


Ala 


Pro 


Pro 


Pro Arg 


Glu Leu 


Asp 


Met 


Leu 




50 










55 








60 








Leu 


Thr 


Ala 


Gly 


Glu 


Arg 


He 


Ser 


Asn 


Ala Leu 


Val Ala 


Met 


Ala 


He 


65 










70 








75 








80 


Glu 


Ser 


Leu 


Gly 


Ala 


His 


Ala 


Arg 


Ser 


Phe Thr 


Gly Ser 


Gin 


Ala Gly 










85 










90 






95 




Val 


lie 


Thr 


Thr 


Gly 


Thr 


His 


Gly 


Asn 


Ala Lys 


He He 


Asp 


Val 


Thr 








100 










105 






110 






Pro 


Gly 


Arg 


Leu 


Gin 


Thr 


Ala 


Leu 


Glu 


Glu Gly 


Arg Val 


Val 


Leu 


Val 






115 










120 






125 








Ala 


Gly 


Phe 


Gin 


Gly 


Val 


Ser 


Gin 


Asp 


Thr Lys 


Asp Val 


Thr 


Thr 


Leu 




130 










135 








140 








Gly 


Arg 


Gly 


Gly 


Ser 


Asp 


Thr 


Thr 


Ala 


Val Ala 


Met 








145 










150 








155 














(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 228: 











(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH : 92 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:228: 



i3 



Pro 


Ala 


Tyr 


Pro 


Ala 


Gly 


Thr 


Asn 


Asn 


Asp Arg 


Leu 


lie 


Ser 


Met 


1 








5 










10 








15 


Asp 


Gly Gly 


He 


Val 


Ala 


Leu 


Pro 


Gin 


Leu Thr Asp Glu Gin Arg 








20 










25 








30 




Ala 


Ala 


Leu 


Glu 


Lys 


Ala 


Ala 


Ala 


Ala 


Arg Arg 


Ala 


Arg 


Ala 


Glu 






35 










40 








45 






Lys 


Asp 


Arg 


Leu 


Lys 


Arg 


Gly 


Gly 


Thr 


Asn Leu 


Thr 


Gin 


Val 


Leu 




50 










55 








60 








Asp 


Ala 


Glu 


Ser 


Asp 


Glu 


Val 


Leu 


Gly 


Lys Met 


Lys 


Val 


Ser 


Ala 


65 










70 








75 










Leu 


Glu 


Ala 


Leu 


Pro 


Lys 


Val 


Gly 


Lys 


Val Gin 


Ala 









PC (2) INFORMATION FOR SEQ ID NO: 22 9: 

JC 

r% (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 72 amino acids 

I_ (B) TYPE: amino acid 

!rf (C) STRANDEDNESS: single 

IE (D) TOPOLOGY: linear 

W (ii) MOLECULE TYPE: protein 

fl 

fU (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 



Thr Val Glu Leu Asp Glu Pro Leu 

1 5 
Asp Thr Glu He Pro Ser Pro Ala 
20 

Ala Gin Glu Asp Asp Thr Val Glu 

35 40 
Gly Asp Ala His Asp Ala Gly Glu 

50 55 
Val Ser Ala Gly Pro Thr Arg He 
65 70 



Val Glu Val Ser Thr Asp Lys Val 

10 15 
Ala Gly Val Leu Thr Lys He He 
25 30 
Val Gly Gly Glu Leu Ser Val He 
45 

Ala Ala Val Pro Ala Pro Gin Lys 
60 



(2) INFORMATION FOR SEQ ID NO: 23 0: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 



Ala 


Ala 


Asp 


He 


Gly 


Ala 


Ala 


Pro 


Ala 


Pro 


Lys Pro Ala Pro 


Lys 


Pro 


1 








5 










10 




15 




Val 


Pro 


Glu 


Pro 


Ala 


Pro 


Thr 


Pro 


Lys 


Ala 


Glu Pro Ala Pro 


Ser 


Pro 








20 










25 




30 






Pro 


Ala 


Ala 


Gin 


Pro 


Ala 


Gly 


Ala 


Ala 


Glu 


Gly Ala Pro Tyr 


Val 


Thr 






35 










40 






45 






Pro 


Leu 


Val 


Arg 


Lys 


Leu 


Ala 


Ser 


Glu 


Asn 


Asn He Asp Leu 


Ala Gly 




50 










55 








60 






Val 


Thr 


Gly 


Thr 


Gly 


Val 


Gly 


Gly 


Arg 


He 


Arg Lys Gin Asp 


Val 


Leu 


55 










70 










75 




80 


Ala 


Ala 


Ala 


Glu 


Gin 


Lys 


Lys 


Arg 


Ala 


Lys 


Ala Pro Ala Pro 


Ala 


Ala 










85 










90 




95 




Gin 


Ala 


Ala 


Ala 


Ala 


Pro 


Ala 


Pro 


Lys 


Ala 


Pro Pro Glu Asp 


Pro 


Met 



100 105 110 



Pro 



(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

Val Leu Val Ser He Ser Ala Asp Glu Asp Ala Thr Val Pro Val Gly 

15 10 15 

Gly Glu Leu Ala Arg He Gly Val Ala Ala Asp He Gly Ala Ala Pro 

20 25 30 

Ala Pro Lys Pro Ala Pro Lys Pro Val Pro Glu Pro Ala Pro Thr Pro 

35 40 45 

Lys Ala Glu Pro Ala Pro Ser Pro Pro Ala Ala Gin Pro Ala Gly Ala 

50 55 60 

Ala Glu Gly Ala Pro Tyr Val Thr Pro Leu Val Arg Lys Leu Ala Ser 
65 70 75 80 

Glu Asn Asn He Asp Leu Ala Gly Val Thr Gly Thr Gly Val Gly Gly 

85 90 95 

Arg He Arg Lys Gin Asp Val Leu Ala Ala Ala Glu Gin Lys Lys Arg 

100 105 110 

Ala Lys Ala Pro Ala Pro 
115 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 2: 

Asp Pro Lys Val Gin He Gin Gin Ala He Glu Glu Ala Gin Arg Thr 

15 10 15 

His Gin Ala Leu Thr Gin Gin Ala Ala Gin Val He Gly Asn Gin Arg 

20 25 30 

Gin Leu Glu Met Arg Leu Asn Arg Gin Leu Ala Asp He Glu Lys Leu 

35 40 45 

Gin Val Asn Val Arg Gin Ala Leu Thr Leu Ala Asp Gin Ala Thr Ala 

50 55 60 

Ala Gly Asp Ala Ala Lys Ala Thr Glu Tyr Asn Asn Ala Ala Glu Ala 
65 70 75 80 

Phe Ala Ala Gin Leu Val Thr Ala Glu Gin Ser Val Glu Asp Leu Lys 

85 90 95 

Thr Leu His Asp Gin Ala Leu Ser Ala Ala Ala Gin Ala Lys Lys Ala 

100 105 HO 

Val Glu Arg Asn Ala Met Val Leu Gin Gin Lys He Ala Glu Arg Thr 

115 120 125 

Lys Leu Leu Ser Gin Leu Glu Gin Ala Lys Met Gin Glu Gin Val Ser 

130 135 140 

Ala Ser Leu Arg Ser Met Ser Glu Leu Ala Ala Pro Gly Asn Thr Pro 
145 150 155 ISO 

Ser Leu Asp Glu Val Arg Asp Lys He Glu Arg Arg Tyr Ala Asn Ala 

165 170 175 

He Gly Ser Ala Glu Leu Ala Glu Ser 
180 185 



(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

Val Ser Thr Ser Thr Trp Val Pro His Pro Val Arg Asp Arg Val He 

15 10 15 

Gly Gin Arg Trp Thr Cys Ala Asp Arg Arg Ser He Glu Glu Ser Thr 

20 25 30 

Glu Met Ala Phe Ser Val Gin Met Pro Ala Leu Gly Glu Ser Val Thr 

35 40 45 

Glu Gly Thr Val Thr Arg Trp Leu Lys Gin Glu Gly Asp Thr Val Glu 

50 55 60 

Leu Asp Glu Pro Leu Val Glu 
65 " 70 

(2) INFORMATION FOR SEQ ID NO: 23 4: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 



Glu 


Val 


His 


Leu 


Pro 


Val 


Asp 


Val 


Gly 


Glu 


Pro 


Arg 


Gin Pro Thr 


1 








5 










10 






15 


Ala 


Val 


Ala 


Asp 


Gin 


Asp 


His 


Arg 


He 


Thr 


Pro 


Val 


Pro Ala His 








20 










25 








30 


His 


Thr 


Pro 


Pro 


Arg 


Val 


Cys 


Gin 


Asp 


Trp 


His 


Arg 


Gin Pro Pro. 






35 










40 










45 


Arg 


Gly 


Arg 


Ala 


Asp 


Gin 


His 


Leu 


Gly 


Leu 


Asp 


Ala 


Arg Leu Cys 




50 










55 










60 




Ala 


Ala 


Cys 


Asn 


Val 


Leu 


Leu 


Val 


Asp 


Gly 


Val 


Gin 


His Arg Pro 


65 










70 










75 






Arg 


His 


Gly 


Pro 


Gly 


Pro 


Arg 


Phe 


Gly 


Phe 


Pro 


Arg 


Val Val Val 










85 










90 






95 


Cys 


Gly 


He 


Arg 


Gin 


Ala 


Arg 


Val 


Glu 


Val 


Glu 


Arg 


Phe Gly Gly 








100 










105 








110 


Leu 


Pro 


Glu 


Arg 


Ala 


His 


Gly 


Val 


Gly 


Gin 


Arg 


Asn 


Asn Arg Val 






115 










120 










125 


Thr 


Asp 


Arg 


Leu 


Thr 


Asp 


Arg 


Met 


Pro 


He 


Asp 


Arg 


Gly Leu Gly 




130 










135 










140 




Glu 


Pro 


Arg 


Ser 


Val 


Gly 


Gly 


Gin 


He 


Asp 


Arg 


Glu 


Arg Asp Gin 


145 










150 










155 






Gin 


Arg 


He 


Pro 


Ala 


Gly 


Lys 


His 


Val 


Thr 


Pro 


His 


Cys Ser Gin 










165 










170 






175 


Arg 


Ser 


Leu 


His 


Leu 


Val 

















180 

(2) INFORMATION FOR SEQ ID NO: 23 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 



Asn Asp Arg Leu He Ser Met Arg 

1 5 
Gin Leu Thr Asp Glu Gin Arg Ala 
20 

Ala Arg Arg Ala Arg Ala Glu Leu 

35 40 
Thr Asn Leu Thr Gin Val Leu Lys 
50 55 



Asp Gly Gly He Val Ala Leu Pro 

10 15 
Ala Ala Leu Glu Lys Ala Ala Ala 
25 30 
Lys Asp Arg Leu Lys Arg Gly Gly 
45 

Asp Ala Glu Ser Asp Glu Val Leu 
60 
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Gly 


Lys 


Met 


Lys 


Val 


Ser 


Ala 


Leu 


Leu 


Glu 


Ala Leu Pro 


Lys Val Gly 


65 










70 










75 


80 


Lys 


Val 


Lys 


Ala 


Gin 


Glu 


He 


Met 


Thr 


Glu 


Leu Glu He 


Ala Pro His 










85 










90 






Pro 


Ala 


Ala 


Phe 


Val 


Ala 


Ser 


Val 


Thr 


Val 


Ser Ala Arg 


Pro Cys Trp 








100 


















Lys 


Ser 


Ser 


Ala 


Pro 


Pro 


Asn 


Pro 


Ala 


Gly 


Arg Arg Cys 


Gly Pro Glu 






115 










12 0 






125 




Gly 


Leu 


Trp 


Trp 


Ala 


Tyr 


Pro 


Arg 


He 


Arg 


Gly Arg Ser 


Gly Leu Thr 




130 










135 








140 




Gly 


Pro 


Ala 


His 


Asn 


Ser 


Gly 


Arg 


Thr 


Pro 


Arg Trp Gly 


Gly Thr Arg 


145 










150 










155 








(2 


INFORMATION FOI 


l SE( 


2 ID 


NO: 236: 






(i) SEQUENCE CHARACTERISTICS: 












(A) 


LENGTH : 


178 


amino acids 












(B) 


TYPE : amino 


acid 














(C) 


STRANDEDNESS : single 












(D) 


TOPOLOGY : linear 












(ii) MOLECULE 


TYPE: protein 










(xi) SEQUENCE 


DESCRIPTION 


SEQ ID 


NO : 2 3 6 : 




Asp 


Trp 


His 


Arg 


Gin 


Pro 


Pro 


His 


Arg 


Gly 


Arg Ala Asp 


Gin His Leu 


1 








5 










10 




15 


Gly 


Leu 


Asp 


Ala 


Arg 


Leu 


Cys 


Ala 


Ala 


Ala 


Cys Asn Val 


Leu Leu Val 








20 










25 






30 


Asp 


Gly 


Val 


Gin 


His 


Arg 


Pro 


Gin 


Arg 


His 


Gly Pro Gly 


Pro Arg Phe 






35 










40 






45 




Gly 


Phe 


Pro 


Arg 


Val 


Val 


Val 


Ala 


Cys 


Gly 


He Arg Gin 


Ala Arg Val 




50 










55 








60 




Glu 


Val 


Glu 


Arg 


Phe 


Gly 


Gly 


Val 


Val 


Pro 


Glu Arg Ala 


His Gly Val 


65 










70 










75 


80 


Gly 


Gin 


Arg 


Asn 


Asn 


Arg 


Val 


Ala 


Thr 


Asp 


Arg Leu Thr 


Asp Arg Met 










85 










90 




95 


Pro 


He 


Asp 


Arg 


Gly 


Leu 


Gly 


Arg 


Glu 


Pro 


Arg Ser Val 


Gly Gly Gin 








100 










105 






110 


He 


Asp 


Arg 


Glu 


Arg 


Asp 


Gin 


Pro 


Gin 


Arg 


He Pro Ala 


Gly Lys His 






115 










120 






125 




Val 


Thr 


Pro 


His 


Cys 


Pro 


Gin 


Pro 


Arg 


Ser 


Leu His Leu 


Val Leu Thr 




130 










135 








140 




Ser 


Arg 


Arg 


His 


Val 


Glu 


Arg 


Gin 


Arg 


His 


Arg Ala Glu 


Glu Gin His 


145 










150 










155 


160 


Glu 


Val 


His 


Ala 


Gly 


Pro 


Leu 


Gly 


Gly 


Ala 


Ser Gin Ser 


Gin Ala Ala 










165 










170 




175 



Pro Arg 



(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:237: 

ATGCCAAGCC GGTGCTGATG CCCGAGCTCG GCGAATCGGT GACCGAGGGG ACCGTCATTC 60 

GTTGGCTGAA GAAGATCGGG GATTCGGTTC AGGTTGACGA GCCACTCGTG GAGGTGTCCA 120 

CCGACAAGGT GGACAC CGAG ATCCCGTCCC CGGTGGCTGG GGTCTTGGTC AGTATCAGCG 180 

CCGACGAGGA CGCCACGGTG CCCGTCGGCG GCGAGTTGGC CCGGATCGGT GTCGCTGCCG 240 

AGATCGGCGC CGCGCCCGCC CCCAAGCCCC C 271 

(2) INFORMATION FOR SEQ ID NO: 23 8: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:238: 



Ala 


Lys 


Pro 


Val 


Leu 


Met 


Pro 


Glu 


Leu 


Gly 


Glu Ser Val Thr Glu Gly 


1 








5 










10 


15 


Thr 


Val 


He 


Arg 


Trp 


Leu 


Lys 


Lys 


He 


Gly 


Asp Ser Val Gin Val Asp 








20 










25 




30 


Glu 


Pro 


Leu 


Val 


Glu 


Val 


Ser 


Thr 


Asp 


Lys 


Val Asp Thr Glu He Pro 






35 










40 






45 


Ser 


Pro 


Val 


Ala 


Gly 


Val 


Leu 


Val 


Ser 


He 


Ser Ala Asp Glu Asp Ala 




50 










55 








60 


Thr 


Val 


Pro 


Val 


Gly 


Gly 


Glu 


Leu 


Ala 


Arg 


He Gly Val Ala Ala Glu 


65 










70 










75 80 


He 


Gly 


Ala 


Ala 


Pro 
85 


Ala 


Pro 


Lys 


Pro 







(2) INFORMATION FOR SEQ ID NO: 23 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 9: 

GAGGTAGCGG ATGGCCGGAG GAGCACCCCA GGACCGCGCC CGAACCGCGG GTGCCGGTCA 60 
TCGATATGTG GGCACCGTTC GTTCCGTCCG CCGAGGTCAT TGACGAT 107 

(2) INFORMATION FOR SEQ ID NO: 240: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:240: 

ATGAAGTTGA AGTTTGCTCG CCTGAGTACT GCGATACTGG GTTGTGCAGC GGCGCTTGTG 
TTTCCTGCCT CGGTTGCCAG CGCAGATCCA CCTGACCCGC ATCAGCCGGA CATGACGAAA 
GGCTATTGCC CGGGTGGCCG ATGGGGTTTT GGCGACTTGG CCGTGTGCGA CGGCGAGAAG 
TACCCCGACG GCTCGTTTTG GCACCAGTGG ATGCAAACGT GGTTTACCGG CCCACAGTTT 
TACTTCGATT GTGTCAGCGG CGGTGAGCCC CTCCCCGGCC CGCCGCCACC GGGTGGTTGC 
GGTGGGGCAA TTCCGTCCGA GCAGCCCAAC GCTCCCTGA 



(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



o 



(xi) SEQUENCE DESCRIPTION 


SEQ ID 


NO: 241: 




Met Lys Leu Lys 


Phe Ala Arg 


Leu 


Ser 


Thr 


Ala He Leu Gly Cys 


Ala 


1 


5 






10 


15 




Ala Ala Leu Val 


Phe Pro Ala 


Ser 


Val 


Ala 


Ser Ala Asp Pro Pro 


Asp 


20 






25 




30 




Pro His Gin Pro 


Asp Met Thr 


Lys 


Gly 


Tyr 


Cys Pro Gly Gly Arg 


Trp 


35 




40 






45 




Gly Phe Gly Asp 


Leu Ala Val 


Cys 


Asp 


Gly Glu Lys Tyr Pro Asp Gly 


50 


55 








60 




Ser Phe Trp His 


Gin Trp Met 


Gin 


Thr 


Trp 


Phe Thr Gly Pro Gin 


Phe 


65 


70 








75 


80 


Tyr Phe Asp Cys 


Val Ser Gly 


Gly 


Glu 


Pro 


Leu Pro Gly Pro Pro 


Pro 


85 






90 


95 




Pro Gly Gly Cys 


Gly Gly Ala 


He 


Pro 


Ser 


Glu Gin Pro Asn Ala 


Pro 


100 






105 




110 





(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 
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GTGACCACGG TGGGCCTGCC ACCAACCCGG GCAGCGGCAG CCGCGGCGGC GCCGGCGGCT 60 

CCGGCGGCAA CGGTGGCGCC GGGGGTAACG CCACCGGCTC AGGCGGCAAG GGCGGCGCCG 12 0 

GTGGCAATGG CGGTGATGGG AGCTTCGGCG CTACCAGCGG CCCCGCCTCC ATCGGGGTCA 18 0 

CGGGCGCCCC CGGCGGCAAC GGCGGCAAGG GCGGCGCCGG TGGCAGCAAC CCCAACGGCT 240 

CAGGTGGCGA CGGCGGCAAA GGCGGCAACG GCGGTGCCGG CGGCAACGGG GGCTCGATCG 3 00 

GCGCCAACAG CGGCATCGTC GGCGGTTCCG GTGGGGCCGG TGGCGCTGGC GGCGCCGGCG 360 

GAAACGGCAG C 371 

(2) INFORMATION FOR SEQ ID NO: 24 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

GTCCGGGTCC CACCACCGCG CCGGCGCGCC CCTAGCGGCC GGGCGCACCA GCCCCTTTTC 60 

TTGACTCGTT CAAGAAAAGG GCCTTCTGTT TGGTCGGCCA TGTTGGCATG ATCGTGACCC 120 

ATGGGCAACA TCGACGTCGA CATCTCGGCC AAGGTCTAGC TCCATGCGAA TCGCCGCCGC 180 

GGTGGT GAG C ATCGGTCTAG CCGTCATAGC AGGGTTCGCG GTACCTGTTG CCGACGCACA 240 

CCCGTCGGAG CCCGGGGTTG TGTCCTACGC GGTGCTCGGA AAGGGGTCGG TCGGCAACAT 3 00 

CGTCGGCGCC CCAATGGGGT GGGAGGCGGT GTTCACCAAG CCGTTCCAGG CGTTTTGGGT 360 

CGAACTACCG GCGTGCAACA ACTGGGTGGA CATCGGGCTG CCCGAGGTGT ACGACGATCC 420 

CGAC 424 

(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

GCGATGGCGG CCGCGGGTAC CACCGCCAAT GTGGAACGGT TTCCCAACCC CAACGATCCT 6 0 

TTGCATCTGG CGTCAATTGA CTTCAGCCCG GCCGATTTCG TCACCGAGGG CCACCGTCTA 12 0 

AGGG CGGATG CGATCCTACT GCGCCGTACC GACCGGCTGC CTTTCGCCGA GCCGCCGGAT 18 0 

TGGGACTTGG TGGAGTCGCA GTTGCGCACG ACCGTCACCG CCGACACGGT GCGCATCGAC 24 0 

GTCATCGCCG ACGATATGCG TCCCGAACTG GCGGCGGCGT CCAAACTCAC CGAATCGCTG 30 0 

CGGCTCTACG ATTCGTC 317 

(2) INFORMATION FOR SEQ ID NO: 24 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

TGGCGTATGC GCTTCGCAGC CGGTGC CGCG TCAACGCGCC GGAGGCAATC GCTTCGCTGC 6 0 

CGAGGAATGG TTCGATCACG ATCGCAGTGT GCCGTCGTGC ACCGACACCG CCGTCCAACG 12 0 

TGAACTGAGG GCGGAAAATC GGCCGAAATC TCGCCCTCAG TTCACGCTCG GCGCCTAACG 180 

GTTCTGGAAG TTGGGTGCGC GCTTCTCGGC GAACGCGCGC GGGCCTTCCT TGGCGTCGTC 24 0 

GGACAGGAAG ACCTTGATGC CGATCTGGGT GTCGATCTTG AACGCCTCGT TTTCGGGCAT 300 

GCACTCGGTC TCGCGGATGG ACCGCAAGAT GGCCTGCACG GCCAGGGGTC CGTTAGCCGA 360 

GATGGCGTCG GCAAGTTCTA GAACCTTGGT CAACGCCTGG CCGTCGGGCA CACGTGGCCG 420 

AT 422 

(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.246: 

GCGTGCCGCT GAACACCAGC CCGCGGCTGC CAGATCTCCC GGACTCGGTA GTGCCGCCGG 60 

TGGCGTCGTT GCTCTCCTGA CGGGGCGCGG CGACCATAAG GTCGCTAATG CCCAGGTAGC 120 

GGCCCAGGTG CATGGAGTCG ATGATGATGC GACTCTCCAG CTCGCCGACC GGGAGCTTGG 180 

CATCGGGCCT GATCAGC CAG GACGCGTAGG ACAAGTCGAT CGAATGCATA GTGGCCTCCA 240 

GAGTGGCCGT GCCACTTCCG GCGTGCTCCA CGGCAAATGC CTTGATTTCT AGCTCCGCGT 3 00 

AGTGTTC CCG CATCGCCTGC GGGATGAATG GGAAC CGCAG GATGGCGACA AACGGGTCTG 360 

ACCTCAGGTT TGCCGCTTTG CGCACAGTGG TCGACAGCCG GTACTCGGCA TAAATGCTGG 420 

CCCCGA 426 

(2) INFORMATION FOR SEQ ID NO: 24 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

AGACCGGCGA GGGTGTGGTC GCTGCCCGCG GCATTGTCGA TAATCTGCGC TGGGTCGACG 60 

CGCCGATCAA CTAGTGAGGC GCAACGCTAG GCTTTGGGAT ACCCACAGCT AAAAAGTTTA 120 

TCAAAGAAAC GAAGAAGGTT GCCATGAGCA CTGTTGCCGC CTACGCCGCC ATGTCGGCGA 180 

CCGAACCCCT GACCAAGACC ACGATCACCC GTCGCGACCC GGGCCCGCAC GACATGGCGA 240 

TCGACATCAA ATTCGCCGGA ATCTGTCGCT CGGACATCCA TACCGTCCAA ACCGAATGGG 300 

GGCAACCGAA TTTACCTGTG GTCCCTG 32 7 
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(2) INFORMATION FOR SEQ ID NO: 24 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

Asp His Gly Gly Pro Ala Thr Asn Pro Gly Ser Gly Ser Arg Gly Gly 

1 " 5 10 15 

Ala Gly Gly Ser Gly Gly Asn Gly Gly Ala Gly Gly Asn Ala Thr Gly 

20 25 30 

Ser Gly Gly Lys Gly Gly Ala Gly Gly Asn Gly Gly Asp Gly Ser Phe 

35 40 45 

Gly Ala Thr Ser Gly Pro Ala Ser He Gly Val Thr Gly Ala Pro Gly 

50 55 60 

Gly Asn Gly Gly Lys Gly Gly Ala Gly Gly Ser Asn Pro Asn Gly Ser 
65 " 70 75 80 

Gly Gly Asp Gly Gly Lys Gly Gly Asn Gly Gly Ala Gly Gly Asn Gly 

85 90 95 

Gly Ser He Gly Ala Asn Ser Gly He Val Gly Gly Ser Gly Gly Ala 

100 105 HO 

Gly Gly Ala Gly Gly Ala Gly Gly Asn Gly Ser 
115 120 

(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

Met Ala Ala Ala Gly Thr Thr Ala Asn Val Glu Arg Phe Pro Asn Pro 

15 10 15 

Asn Asp Pro Leu His Leu Ala Ser He Asp Phe Ser Pro Ala Asp Phe 

20 25 30 

Val Thr Glu Gly His Arg Leu Arg Ala Asp Ala He Leu Leu Arg Arg 

35 " 40 45 

Thr Asp Arg Leu Pro Phe Ala Glu Pro Pro Asp Trp Asp Leu Val Glu 

50 55 60 

Ser Gin Leu Arg Thr Thr Val Thr Ala Asp Thr Val Arg He Asp Val 
65 70 75 80 

He Ala Asp Asp Met Arg Pro Glu Leu Ala Ala Ala Ser Lys Leu Thr 

85 90 95 

Glu Ser Leu Arg Leu Tyr Asp Ser 
100 
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(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

Ala Tyr Ala Leu Arg Ser Arg Cys Arg Val Asn Ala Pro Glu Ala lie 

1 5 10 15 . 

Ala Ser Leu Pro Arg Asn Gly Ser lie Thr lie Ala Val Cys Arg Arg 

20 25 30 

Ala Pro Thr Pro Pro Ser Asn Val Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:251: 

Val Pro Leu Asn Thr Ser Pro Arg Leu Pro Asp Leu Pro Asp Ser Val 

15 10 15 

Val Pro Pro Val Ala Ser Leu Leu Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:252: 

Met Ser Thr Val Ala Ala Tyr Ala Ala Met Ser Ala Thr Glu Pro Leu 

15 10 15 

Thr Lys Thr Thr lie Thr Arg Arg Asp Pro Gly Pro His Asp Met Ala 

20 25 30 

lie Asp He Lys Phe Ala Gly He Cys Arg Ser Asp lie His Thr Val 

35 40 45 
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Gin Thr Glu Trp Gly Gin Pro Asn Leu Pro Val Val Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

GCTTGGAGCC CTGGAGCGAC GGTGTGGGTC TGGGGGTCGA TTCGTTCTCG GCGAAAGTCA 60 

ACTAAAGACC ACGTTGACAC CCAACCGGCG GCCCGGCATG GGCCGTCGCG GCGTAGAAGC 120 

TTTGACCGCG GCGC GAAACG TTCGCTGCTG CGGCCCATGC AGATCGCACA CGCTTGCTTG 180 

AACATCGGGT GGAGCCGGTG GTAACGCCAG GCT 213 

(2) INFORMATION FOR SEQ ID NO: 2 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:254: 

CCGAGCTGCT GTTCGGCGCC GGC GGTGCGG GCGGCGCGGG TGGGGCGGGC ACCGACGGCG 60 

GGCCCGGTGC TACCGGCGGG ACCGGCGGAC ACGGCGGAGT CGGCGGCGAC GGCGGATGGC 120 

TGGCACCCGG CGGGGCCGGC GGGGCCGGCG GGCAAGGCGG GGCAGGTGGT GCCCGCAGCG 180 

ATGGTGGCGC GTTGGGTGGT ACCGGCGGGA CGGGCGGTAC CGGCGGCGCC GGTGGCGCCG 240 

GCGGTCGCGG CACACT GCTG CTGGGCGCTG GCGGACAGGG CGGCCTCGGC GGCGCCGGCG 300 

GACAAGGCGG CACCGGCGGG GGCCGGCGGA GATGGCGTTC TGGGGGGTGT CAGTGGCACT 360 

GGTGGTA 367 

(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

AAGGCGTGAT TGGCAAGGCG ACCGCGCAGC GGCCCGTAGC CGCGGGACGG CCCAGGCCCC 60 
GACCGCAGCG GCCGGTGTCT GACCGGGTCA GCGACCAGCG GCGCTGACCG TGCCGCTCGT 120 
CTACTTCGAC GCCAGCGCCT TCGTCAAACT TCTCACCACC GAGACAGGGA GCTCGCTGGC 180 
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GTCCGCTCTA TGGGACGGCT GCGACGCCGC ATTGTCCAAC CGCCTGGCCT ACCCCGAAGT 240 

CCGCGCCGCA CTCGCTGCAA CGGGCCGCAA TCACGACCTA ACCGAATCCG AGCTCGCCGA 3 00 

CGCCGAGCGT GACTGGGAGG ACTTCTGGGC CGCACCCGCC CAGTCGAACT CACCGCGACG 360 

GTTGAACAGC ACGCCGGGCA CCTCGCCCGA ACACATGCCT TACGCGGAGC CGACACCGTT 420 

(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

CTCTTGTCGG TGGCATCGGC GGTACCGGCG GAACCGGCGG CAACGCCGGT ATGCTCGCCG 60 

GCGCCGCCGG GGCCGGCGGT GCCGGCGGGT TCAGCTTCAG CACTGCCGGT GGGGCTGGCG 120 

GCGCCGGCGG GGCCGGTGGG CTGTTCACCA CCGGCGGTGT CGGCGGCGCC GGTGGGCAGG 180 

GTCACACGGG CGGGGCGGGC GGCGCCGGCG GGGCCGGCGG GTTGTTTGGT GCCGGCGGCA 240 

TGGGCGGGGC GGGCGGATTC GGGGATCACG GAACGCTCGG C AC CGGCGGG GCCGGCGGG 299 

(2) INFORMATION FOR SEQ ID NO: 25 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

Leu Glu Pro Trp Ser Asp Gly Val Gly Leu Gly Val Asp Ser Phe Ser 

15 10 15 

Ala Lys Val Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:258: 

Glu Leu Leu Phe Gly Ala Gly Gly Ala Gly Gly Ala Gly Gly Ala Gly 

15 10 15 

Thr Asp Gly Gly Pro Gly Ala Thr Gly Gly Thr Gly Gly His Gly Gly 
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Val Gly 


Gly 


Asp 


Gly 


Gly 


Trp 


Leu 


Ala 


Pro 


Gly Gly Ala Gly Gly Ala 


Gly Gly 


Gin 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Arg 


Ser Asp Gly Gly Ala Leu 






















Gly Gly 


Thr 


Gly 


Gly 


Thr 


Gly 


Gly 


Thr 


Gly 


Gly Ala Gly Gly Ala Gly 


65 








70 










75 80 


Gly Arg 


Gly 


Thr 


Leu 
85 


Leu 


Leu 


Gly 


Ala 


Gly 
90 


Gly Gin Gly Gly Leu Gly 
95 


Gly Ala 


Gly 


Gly 
100 


Gin 


Gly 


Gly 


Thr 


Gly 
105 


Gly 


Gly Arg Arg Arg Trp Arg 
110 


Ser Gly 


Gly 
115 


Cys 


Gin 


Trp 


His 


Trp 
120 


Trp 







(2) INFORMATION FOR SEQ ID NO: 259: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 34 amino acids 
{B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 9: 



Gly Val He Gly Lys Ala Thr Ala Gin Arg Pro Val Ala Ala Gly Arg 

15 10 15 

Pro Arg Pro Arg Pro Gin Arg Pro Val Ser Asp Arg Val Ser Asp Gin 
20 25 30 

Arg Arg 



(2) INFORMATION FOR SEQ ID NO: 2 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

Leu Val Gly Gly He Gly Gly Thr Gly Gly Thr Gly Gly Asn Ala Gly 

15 10 15 

Met Leu Ala Gly Ala Ala Gly Ala Gly Gly Ala Gly Gly Phe Ser Phe 

20 25 30 

Ser Thr Ala Gly Gly Ala Gly Gly Ala Gly Gly Ala Gly Gly Leu Phe 

35 40 45 

Thr Thr Gly Gly Val Gly Gly Ala Gly Gly Gin Gly His Thr Gly Gly 

50 55 60 

Ala Gly Gly Ala Gly Gly Ala Gly Gly Leu Phe Gly Ala Gly Gly Met 
65 70 75 80 
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Gly Gly Ala Gly Gly Phe Gly Asp His Gly Thr Leu Gly Thr Gly Gly 
85 90 95 

Ala Gly Gly 



(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:261: 

TCCTGTTCGG CGCCGGCGGG GTGGGCGGTG TTGGCGGTGA CGGTGTGGCA TTCCTGGGCA 6 0 

CCGCCCCCGG CGGGCCCGGT GGTGCCGGCG GGGCCGGTGG GCTGTTCAGC GTCGGTGGGG 120 

CCGGCGGCGC CGGCGGAATC GGATTGGTCG GGAACAGCGG TGCCGGGGGG TCCGGCGGGT 18 0 

CCGCCCTGCT CTGGGGCGAC GGCGGTGCCG GCGGCGCGGG TGGGGTCGGG TCCACTACCG 240 

GCGGTGCCGG CGGGGCGGGC GGCAACGCCA GCCTGCTGGT AA 282 

(2) INFORMATION FOR SEQ ID NO: 2 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

CGGCACGAGC CGTGCTACTG GTCAACTGAT GCCCTGATTG TGACCTTCCC GGCGCCGGAT 60 

CAGTGCTTCT CAGGACCGAC GTAATATTCG AAAACCAATC CGGCCGCCGA GGCGAGGATG 120 

AATGCCACAC CGGCGGCGAT CAGCCACGGG AGCCACAACG CGATGCCGAC CGCTGCCACC 180 

GAGCCGGACA ACGCGACCAT GATCGGCCAC CAGCTATGCG GACT GAAGAA TCCAAGTTCT 240 

CCTGCGCCGT CGCTGATTTC AGCGCCTTCG TAGTCCTCGG GCCGGGAATC TAACCGGCGG 3 00 

GCCACAAACC GGAAGAAGGT GGCGACGATC AACGCCATGC CGCCGGTGAG CGCCAACGCA 360 

AT GGTGCCAG CCCACTCGAC ACCACCGGTG GCGAACATCG AGGTCAACAC GCCGT 415 

(2) INFORMATION FOR SEQ ID NO: 2 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:263: 
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TCACCGCGTG AACGGTTCGT AACACTGATA CGTATGCTTG TCAGCGAGCA GATCAAGTCC 60 

AGTCCGACCA ATGCCAGGAG ATCATCGGCT AGGCTCACGG TTTCGCCTGG GACGAGACGG 120 

TATTGAGTTC TGGCGTTGGA CGGTCCGTGG CGTGGTGGGA AGTCTGACGC GGCATCAGAA 180 

CGGTTGTCAA TACCAGTCTT TGGGGGATAT GGCCTATTTG GTGTCGTCGG GCCGCTCCAC 240 

CGGATCCCTT TTCGAACGTT GCGCAAGCGC GGTCCAGTTA CGGCCTGTTC ACTGCGCGCT 300 

GGCGTAGCTG CGCGGCCTCG ATCGGTTTGA ACGTCATCGC AATTCCCGCA ATGGGTGAGT 360 

ACCTGACGCT CCT 373 

(2) INFORMATION FOR SEQ ID NO:264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:264: 

CCAAACCGGA CAGGCCGGCA GCGACGGTCG GAAGTTGCAC CACGGTGCGC GCTCCATGTA 60 

GCCAACCGGT GACCACGGCG TAGACAGCAG ATC CGTGGAT CGCGCGTTCG GTGTCGTCCG 120 

GGCCGAGTAC CCGCGGGCCG AACCGCAGCG ACCAAAGCAA CGCGATCGAT ACGGGGATCG 180 

CCACTCGTGC CGAATTCGAG CTCCGTCGAC AAGCTTGCGG CCGCACTCGA ACCCGGGTGA 240 

ATGATTGAGT TTAAACCGCT TAGCAATAAC TAGCATAACC CCTTGGGGCC TCTAAACGGG 3 00 

TCTTGAGGGG TTTTTTGCTG AAAGGAGGAA CTATATCCGG ATAACCTGGC GTAGTAGCGA 360 

AGAGGC CCGC ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGACGCG 420 

CCC 423 

(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:265: 

AGTGGCCAGC CGGTCGGCCA ATGCATCCAG CTCCCGGTAC GTCAGCTGAC CATCCGCCCA 60 

ACTGACCGCC AC CGAGT C AG GCTGTGCCGC AGCGATTTCG GCGAACCGGG TATGCACCGC 120 

GGGTGCCGAC GTCGT CACAT CCGGCAGGCC GGGTGCGGTC GGATCGTGCT CGCCGTCCAG 180 

CAGAATGTCG ACGTCGCGCA GCGGCCGATC CCACCGGCTG ACCAAGCGCT GTAACACAGC 240 

CAGCACCCGC CTGCCGAGGC TTTCGGGCGC CATCGTGCCC AGCGCACCGT CGAGCACCTC 300 

CACTAGCAGC GTGAGCTCAC CGGTGCTGCG GTGCGCGGCG ACGGTCACCG GAAAGTGCGA 360 

CAAACTCTCT AGCGCCACCG GACGGAACGT CACCCCGTTT GCGA 404 

(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 

GTCCTGGTCG CAGGCTGTTC TTCGAACCCG CTGGCTAACT TCGCACCCGG GTATCCGCCC 60 

ACCATCGAAC CCGCCCAACC GGCGGTGTCA CCGCCTACTT CGCAAGACCC GGCCGGTGCA 120 

GTGCGACCAC TGAGCGGCCA CCCCCGGGCG GCACTATTCG ACAACGGCAC CCGCCAATTG 180 

GTGGCTCTGC GCCCGGGCGC CGATTCGGCG GCACCCGCCA GCATCATGGT CTTCGATGAC 240 

ATGCACGTTG CACCGCGCGT CATTTTTCTG CCGGGCCCGG CAGCCGCGTT GACCAGCGAC 3 00 

GACCACGGCA CGGCCTTCCT TGCCGCCCGC GGCGGCTACT TCGTGGCCGA CCTGTCCTCC 360 

GGTCACACCG CACGAGTGAA TGTCGCTGAC GCAGCGCACA CCGATTTCAC CGCGATCGCC 420 

C 421 

(2) INFORMATION FOR SEQ ID NO: 2 67: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.267: 

ATGCATATCA CGCTCAACGC CATCCTGCGT GCGATCTTCG GGGCCGGCGG CAGTGAACTA 60 

GACGAGCTGC GCCGCCTCAT TCCGCCGTGG GTCACGCTGG GCTCGCGCCT GGCGGCGCTA 120 

CCGAAACCCA AACGCGACTA TGGCCGCCTT AGCCCGTGGG GCCGGCTGGC CGAGTGGCGG 180 

CGCCAGTACG ACACTGTCAT CGACGAGCTC ATCGAAGCCG AGCGGGC CGA CCCGAACTTC 24 0 

GCCGATCGGA CCGACGTTTT GGCGTTGATG CTGCGCAGCA CTTACGACGA CGGTTCCATC 3 00 

ATGTCGCGCA AGGACATTGG CGACGAACTG CTCACGCTGC TTGCCGCCGG GCACGAAACC 360 

ACGGCGGCGA CATGGGCTGG GCGTTCGAAC GGCTCAACCG GCACCCCGAC GTGCTCGCGG 420 

CTCTGG 426 



(2) INFORMATION FOR SEQ ID NO: 2 68: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 



GTCCTGGTCG CAGGCTGTTC TTCGAACCCG CTGGCTAACT TCGCACCCGG GTATCCGCCC 60 

ACCATCGAAC CCGCCCAACC GGCGGTGTCA CCGCCTACTT CGCAAGACCC GGCCGGTGCA 120 

GTGCGACCAC TGAGCGGCCA CCCCCGGGCG GCACTATTCG ACAACGGCAC CCGCCAATTG 180 

GTGGCTCTGC GCCCGGGCGC CGATTCGGCG GCACCCGCCA GCATCATGGT CTTCGATGAC 24 0 

GTGCACGTTG CACCGCGCGT CATTTTTCTG CCGGGCCCGG CAGCCGCGTT GACCAGCGAC 300 

GACCACGGCA CGGCCTTCCT TGCCGCCCGC GGCGGCTACT TCGTGGCCGA CCTGTCCTCC 360 

GGTCACACCG CACGAGTGAA TGTCGCTGAC GCAGCGCACA CCGATTTCAC CGCGATCGCC 420 
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CGCCGCTCCG ACGGCAAGCT GGTGCTGGGC AGCGCAGATG GCGCCGTCTA CACGCTTGCC 480 
AAGAACCCGC AGTTGACCGG CGTCGGCGCC GCCACCGTAG CC 522 

(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 739 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SEQ ID NO:269: 

ACCGCCGTCC GGCGGCCCCA GCCCCTGGGC CCAGACCCCG CGCAAAACCA 60 

CTTAGTGGCC GGCGCCGCCG CCGTCGTGCT CGTCCTCGTG TTGGGCGCCA 12 0 

GATCGCCATC CGGCCCAAGC CGGTACAGCC GCCTCAGCCG GTTGCGGAGG 180 

CGCCCTACTG CTGAACTCCT CAGAAGTCAA CGCCGTGATG GGCTCGTCGT 240 

GGGCAAACCG ATCACATCGA TGGACTCTTC GCCGGTGACG GTGTCCCTGC 300 

GGGCGCGCTG TATACCAGCC AGGATCCGGT GTATGCCGGC ACCGGCTACA 360 

CGGCTTGATT TCATCCGAGC CGGGCGACAA CTACGAACAT TGGGTGAACC 420 

CGCCTTTCCG ACCGCCGACA AAGCCCGCGC GTTCGTGCAG ACTTCGGCCG 480 

GAACTGCGCA GGCAAGACGG TCACCGTCAC GAATAAGGCC AAGACCTACC 540 

TGCCGACGTC AAAGGCAGCC CGCCGACGAT CACGGTGATA GACACCCAAG 600 

GGGCTGGGAA TGCCAACGCG CGATGAGCGT GGCCAACAAT GTGGTTGTCG 660 

ATGCGGGTAC CAGATCACCA ATCAAGCAGG CCAGATCGCC GCCAAGATCT 720 

TCAACAAGG 73 9 

(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

AGACGTCGTC GAGGCCGCCA TCGCCCGCGC CGAAGCCGTT AACCCGGCAC TGAACGCGTT 60 
GGCGTATGC 69 

(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ii) 
(xi) 

GCTGGGGCGC 
ACCCGTGGCC 
TCGGCATCTG 
AGCGCCTTAG 
CCATGCAGCC 
CGGACTGCCA 
CCGCCATCAA 
AAGCCGTCGT 
ACAAATGGAA 
GGTGGACGTT 
AAGGCGCTGA 
ACGTCAACGC 
GTTGACAAAG 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 
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ACTGCACCCG GCAGGCGCGA CCAACGGATC GGGTCAACTA GCACTGCCGG TGGAGGCGCC 60 

CCCGCGGTCT GTGCCTTCCC ACGGGGAACC CTTGGGCAGC GCGGCTCCAG AAGGGTTGGA 12 0 

GGGAGAGTTC GACGACCGTA TCGACGAGCG GTTCCCGGTC TTCAGCTCGG CCAGTCTCGC 180 

CGAAGCGCTG CCGGGTCCGC TGACCCCGAT GACGCTGGAT GTCCAGTTGA GTGGACTGCG 24 0 

CGCGGCCGGT CGGGCGATGG GTCGGGTACT GGCGCTTGGC GGTGTCGTTG CCGATGAGTG 300 

GGAGAGAAGA GCCATCGCGG TGTTCGGTCA CCGCCCGTAT ATCGGAGTGT CGGCCAATAT 360 

TGTGGCCGCC GCCCAACTGC CGGGGTGGGA CGCGCAGGCC GTAACCCGGC GGGCACTGGG 42 0 

CGAGCAACCG CAGGTCACTG AGCTGCTTCC GTTTGGTCGA CCGCAACTTG CGGGCGGACC 480 

GCTCGGCTCG GTCGCGAAGG TGGTCGTGAC GGCACGGTCG CTG 523 

(2) INFORMATION FOR SEQ ID NO: 2 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
, s (D) TOPOLOGY: linear 

fz (ii) MOLECULE TYPE : cDNA 

O 

4f (xi) SEQUENCE DESCRIPTION: SEQ ID NO:272: 

£ 

GTGTCGGTGT CGTCGGGGTA GGAGCGACTT CCCCGGCCGG CGCCGGCGCC GGAGCGGGCT 6 0 

4=; CTGCAGGAAC CGGTGCCGGC GCCGGCGGCG GGGCGACCAA AGGCCGGATC GATTCGGCCA 12 0 

W GCGCCTTGGC CGCGCCCTTG TCCACCGGGT TGTTGGCGGT CCCGAGCCAT ACCACAAACC 18 0 

=_ AACGCTGAAG GGGCCCGGCG TCCGGTGCGT TCGCCGCGGG CGAC 224 

IjJ (2) INFORMATION FOR SEQ ID NO:273: 

Ul (i) SEQUENCE CHARACTERISTICS: 

Q (A) LENGTH: 521 base pairs 

jpy (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 

TGAACTGACT GCCCCGCTCG ATCGGCGGCG GCGGCGTGTC ATAGCTGCGC CGCCAGGCCA 60 

TGAACTGCTC TTCGCCATAG CGGGCCTTGG TCTCGGCCTT GTCCAAACCC TGCAGCGCGC 12 0 

CGTAGTGGCG TTCGTTGAGC CGCCAGCTAC GCCGCACGGG AATCCAGAGC CGATCGGCGC 180 \ 

TGTCCAACGC CAGATGCGCG GTGGTGATCG CGCGCCGCAG CAACGAGGTG TAGAGCACGT 240 

CGGGCAATAG GTCGTGTTCC GCGATCAGCT CGCCGCTTCG AACCGCCTCT GCCTGGCCCT 300 

TGTCCGTCAG GCCGACATCG ACCCAGCCGG TGAACAGGTT GAGGGCATTC CAGTCGCTCT 36 0 

CGCCGTGGCG CAGCAACACC AGGCTGCCAG TGTTTGCCAT AC CGGCAAGT CTCTCACGCA 420 

CTCCCGCACT CCTCATCGTG GACCAAAATG CCCGAATTCT CCTCGGTCCG CTGCGCAGCG 480 

CGTTCATACC GCCGAGGTGG TCGGCACCGT AACGGCCGGT T 521 

(2) INFORMATION FOR SEQ ID NO: 274: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:274: 

CTCCAGGCTC ATTCGCTCGA ACAAAGCCAC CCGGCCGTAC AGCGGACGCC CCCATTCGTT 60 

GTCGTGATAG TCGCGGTACA GCTGGGCATC GGGCCCTGGA CGAACCTCCG CCCAGGGGCA 120 

GCGAACCAGC CCGTCGCCGC TCACGCGGGG TCAGAACGGT AGTGCACGAC AGTCTCGCCG 180 

CGCGAAGGGT TTGACGCGTC AGACTCGGCC TCGGCGTCTT CCGACGAGGC GTGGATCGCC 240 

CCGAGCTGAG AGCGTAGCGC CTCGAGCTCA CGGCCGAGCC GTTCCAGCAC CCAGTCCACC 300 

TCGCTGGTCT TGTTCCCGCG CAGCACCTGC GTGAAC TTGA CCGCGTCGAC ATCGGCGCGG 360 

GTGACCCCGA ACGCCGGCAG CGTCGTCGCC GTCGTCGCCC GCGGCAGGGG CGGCAACTGC 420 

TCGCCA 426 

(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 219 base pairs 
O (B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:275: 

GCGGACACGG CGGACAAAGC GCAATCGGCC TCGGCGGCGG CGCCGGCGGC GACGGGGGCC 60 

AGGGCGGCGC CGGCCGCGGA CTGTGGGGTA CTGGCGGCGC CGGCGGACAC GGCGGGGCAA 120 

GGCGGTGGTA CCGGGGGCCC ACCGCTGCCC GGTCAGGCAG GCATGGGCGC CGCGGGTGGC 180 

GCCGGTGGGC TGATCGGCAA CGGCGGGGCC GGCGGCGAC 219 

(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:276: 

AAGATCATCG GCGCCGCTCC TTAGCATCGC TGCGCTCTGC ATCGTCGCCG GCGCGGATCA 60 

CGGAGGTCCG GCCTTGTACC CCACTCCTCG AACGGTCAGC ACCACAGTCG GGTTCTCGGG 120 

ATCCTTTTCG ACCTTGGCCC GCAGACGCTG GACATGCACG TTCACCAGCC TGGTATCGGC 180 

TGGGTGCCGG TAACCCCATA CCTGTTCGAG CAGCACATCA CGAGTAAACA CCTGGCGCGG 240 

CTTGCGCGCC AATGCGACCA ACAGGTCGAA TTCCAGCGGT GTCAACGAGA TCTGCTCACC 300 

GTTGCGAGTG ACCTTGTGCG CCGGTACGTC GATTTCTACG TCGGCGATGG ACAGCATCTC 36 0 

GGCGGGTTCG TCGTCGTTGC GGCGCAGCCG CGCCCGCACC CGCGCAACCA GCTCCTTGGG 42 0 

CTTGAACGGC TTCATGATGT AGTCGTCGGC GCCCGACTCC AGAC CCAGCA CCACATCCAC 480 

GGTGTCGGTC TTTGCGGTGA GCATCACGAT CGGAACACCG GAATCGGCGC GCAACACCCG 540 

GCACACGTCG ATGCCGTTCA TACCGGGGCA A 571 
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(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 



Leu Phe 


Gly Ala 


Gly 


Gly Val Gly 


Gly 


Val 


Gly 


Gly Asp Gly Val Ala 


1 




5 






10 




15 


Phe Leu 


Gly Thr 


Ala 


Pro Gly Gly 


Pro 


Gly 


Gly 


Ala Gly Gly Ala Gly 




20 






25 






30 


Gly Leu 


Phe Ser 


Val 


Gly Gly Ala 


Gly 


Gly 


Ala 


Gly Gly lie Gly Leu 




35 




40 








45 


Val Gly 


Asn Ser 


Gly 


Ala Gly Gly 


Ser 


Gly 


Gly 


Ser Ala Leu Leu Trp 


50 






55 








60 


Gly Asp 


Gly Gly 


Ala 


Gly Gly Ala 


Gly 


Gly 


Val 


Gly Ser Thr Thr Gly 


55 






70 






75 


80 


Gly Ala 


Gly Gly 


Ala 


Gly Gly Asn 


Ala 


Ser 


Leu 


Leu Val 






85 






90 







(2) INFORMATION FOR SEQ ID NO: 2 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

Met Pro Pro Val Ser Ala Asn Ala Met Val Pro Ala His Ser Thr Pro 

15 10 15 

Pro Val Ala Asn lie Glu Val Asn Thr Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 2 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 



235 



Lys Pro Asp Arg Pro Ala Ala Thr Val Gly Ser Cys Thr Thr Val Arg 

1 5 10 15 

Ala Pro Cys Ser Gin Pro Val Thr Thr Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 28 0: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 

Trp Pro Ala Gly Arg Pro Met His Pro Ala Pro Gly Thr Ser Ala Asp 

1 5 10 15 

His Pro Pro Asn 

20 

(2) INFORMATION FOR SEQ ID NO: 2 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

Val Leu Val Ala Gly Cys Ser Ser Asn Pro Leu Ala Asn Phe Ala Pro 

1 5 10 15 

Gly Tyr Pro Pro Thr lie Glu Pro Ala Gin Pro Ala Val Ser Pro Pro 

20 25 30 

Thr Ser Gin Asp Pro Ala Gly Ala Val Arg Pro Leu Ser Gly His Pro 

35 40 45 

Arg Ala Ala Leu Phe Asp Asn Gly Thr Arg Gin Leu Val Ala Leu Arg 

50 55 60 

Pro Gly Ala Asp Ser Ala Ala Pro Ala Ser He Met Val Phe Asp Asp 
65 70 75 80 

Met His Val Ala Pro Arg Val He Phe Leu Pro Gly Pro Ala Ala Ala 

85 90 95 

Leu Thr Ser Asp Asp His Gly Thr Ala Phe Leu Ala Ala Arg Gly Gly 

100 105 HO 

Tyr Phe Val Ala Asp Leu Ser Ser Gly His Thr Ala Arg Val Asn Val 

115 120 125 

Ala Asp Ala Ala His Thr Asp Phe Thr Ala He Ala 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 282: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 



Met His 


He 


Thr 


Leu 


Asn 


Ala 


He 


Leu 


Arg 


Ala He Phe Gly Ala Gly 


1 






5 










10 


15 


Gly Ser 


Glu 


Leu 


Asp 


Glu 


Leu 


Arg 


Arg 


Leu 


He Pro Pro Trp Val Thr 






20 










25 




30 


Leu Gly 


Ser Arg 


Leu 


Ala 


Ala 


Leu 


Pro 


Lys 


Pro Lys Arg Asp Tyr Gly 




35 










40 






45 


Arg Leu 


Ser 


Pro 


Trp 


Gly 


Arg 


Leu 


Ala 


Glu 


Trp Arg Arg Gin Tyr Asp 


50 










55 








60 


Thr Val 


He 


Asp 


Glu 


Leu 


He 


Glu 


Ala 


Glu 


Arg Ala Asp Pro Asn Phe 


65 








70 










75 80 


Ala Asp 


Arg 


Thr 


Asp 


Val 


Leu 


Ala 


Leu 


Met 


Leu Arg Ser Thr Tyr Asp 








85 










90 


95 


Asp Gly 


Ser 


He 


Met 


Ser 


Arg 


Lys 


Asp 


He 


Gly Asp Glu Leu Leu Thr 






100 










105 




110 


Leu Leu 


Ala 


Ala 


Gly 


His 


Glu 


Thr 


Thr 


Ala 


Ala Thr Trp Ala Gly Arg 




115 










120 






125 


Ser Asn 


Gly Ser 


Thr 


Gly 


Thr 


Pro 


Thr 


Cys 


Ser Arg Leu Trp 


130 










135 








140 



(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 



Val Leu 


Val 


Ala 


Gly 


Cys 


Ser 


Ser 


Asn 


Pro 


Leu 


Ala 


Asn 


Phe 


Ala 


1 






5 










10 










15 


Gly Tyr 


Pro 


Pro 


Thr 


He 


Glu 


Pro 


Ala 


Gin 


Pro 


Ala 


Val 


Ser 


Pro 






20 










25 










30 




Thr Ser 


Gin 


Asp 


Pro 


Ala 


Gly 


Ala 


Val 


Arg 


Pro 


Leu 


Ser 


Gly His 




35 










40 










45 






Arg Ala 


Ala 


Leu 


Phe 


Asp 


Asn 


Gly 


Thr 


Arg 


Gin 


Leu 


Val 


Ala 


Leu 


50 










55 










60 








Pro Gly 


Ala 


Asp 


Ser 


Ala 


Ala 


Pro 


Ala 


Ser 


He 


Met 


Val 


Phe 


Asp 


65 








70 










75 










Val His 


Val 


Ala 


Pro 


Arg 


Val 


He 


Phe 


Leu 


Pro 


Gly 


Pro 


Ala 


Ala 








85 










90 










95 


Leu Thr 


Ser 


Asp 


Asp 


His 


Gly 


Thr 


Ala 


Phe 


Leu 


Ala 


Ala 


Arg 


Gly 



100 105 110 



237 



Tyr Phe Val Ala Asp 
115 

Ala Asp Ala Ala His 
130 

Gly Lys Leu Val Leu 
145 

Lys Asn Pro 



Leu Ser Ser Gly His Thr 
120 

Thr Asp Phe Thr Ala lie 

135 

Gly Ser Ala Asp Gly Ala 
150 155 



Ala Arg Val Asn Val 
125 

Ala Arg Arg Ser Asp 
140 

Val Tyr Thr Leu Ala 

16 0 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

M= (ii) MOLECULE TYPE: protein 

Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO:284: 

00 



PU 



Trp 


Gly 


Ala 


Pro 


Pro 


Ser 


Gly 


Gly 


Pro 


Ser 


Pro 


Trp 


Ala Gin Thr Pro 


1 








5 










10 






15 


Arg 


Lys 


Thr 


Asn 


Pro 


Trp 


Pro 


Leu 


Val 


Ala 


Gly 


Ala 


Ala Ala Val Val 








20 










25 








30 


Leu 


val 


Leu 


Val 


Leu 


Gly 


Ala 


He 


Gly 


He 


Trp 


He 


Ala He Arg Pro 






35 










40 










45 


Lys 


Pro 


Val 


Gin 


Pro 


Pro 


Gin 


Pro 


Val 


Ala 


Glu 


Glu 


Arg Leu Ser Ala 




50 










55 










60 




Leu 


Leu 


Leu 


Asn 


Ser 


Ser 


Glu 


Val 


Asn 


Ala 


Val 


Met 


Gly Ser Ser Ser 


65 










70' 










75 




80 


Met 


Gin 


Pro 


Gly 


Lys 


Pro 


He 


Thr 


Ser 


Met 


Asp 


Ser 


Ser Pro Val Thr 










85 










90 






95 


Val 


Ser 


Leu 


Pro 


Asp 


Cys 


Gin 


Gly 


Ala 


Leu 


Tyr 


Thr 


Ser Gin Asp Pro 








100 










105 








110 


Val 


Tyr 


Ala 


Gly 


Thr 


Gly 


Tyr 


Thr 


Ala 


He 


Asn 


Gly 


Leu He Ser Ser 






115 










120 










125 


Glu 


Pro 


Gly 


Asp 


Asn 


Tyr 


Glu 


His 


Trp 


Val 


Asn 


Gin 


Ala Val Val Ala 




130 










135 










140 




Phe 


Pro 


Thr 


Ala 


Asp 


Lys 


Ala 


Arg 


Ala 


Phe 


Val 


Gin 


Thr Ser Ala Asp 


145 










150 










155 




160 


Lys 


Trp 


Lys 


Asn 


Cys 


Ala 


Gly 


Lys 


Thr 


Val 


Thr 


Val 


Thr Asn Lys Ala 










165 










170 






175 


Lys 


Thr 


Tyr 


Arg 


Trp 


Thr 


Phe 


Ala 


Asp 


Val 


Lys 


Gly 


Ser Pro Pro Thr 








180 










185 








190 


He 


Thr 


Val 


He 


Asp 


Thr 


Gin 


Glu 


Gly 


Ala 


Glu 


Gly 


Trp Glu Cys Gin 






195 










200 










205 


Arg 


Ala 


Met 


Ser 


Val 


Ala 


Asn 


Asn 


Val 


Val 


Val 


Asp 


Val Asn Ala Cys 




210 










215 










220 




Gly 


Tyr 


Gin 


He 


Thr 


Asn 


Gin 


Ala 


Gly 


Gin 


He 


Ala 


Ala Lys He Cys 


225 










230 










235 




240 



(2) INFORMATION FOR SEQ ID NO: 285: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

Asp Val Val Glu Ala Ala He Ala Arg Ala Glu Ala Val Asn Pro Ala 

15 10 15 

Leu Asn Ala Leu Ala Tyr 
20 



(2) INFORMATION FOR SEQ ID NO: 2 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:286: 

Leu His Pro Ala Gly Ala Thr Asn Gly Ser Gly Gin Leu Ala Leu Pro 

15 10 15 

Val Glu Ala Pro Pro Arg Ser Val Pro Ser His Gly Glu Pro Leu Gly 

20 25 30 

Ser Ala Ala Pro Glu Gly Leu Glu Gly Glu Phe Asp Asp Arg He Asp 

35 40 45 

Glu Arg Phe Pro Val Phe Ser Ser Ala Ser Leu Ala Glu Ala Leu Pro 

50 55 60 

Gly Pro Leu Thr Pro Met Thr Leu Asp Val Gin Leu Ser Gly Leu Arg 
65 70 75 80 

Ala Ala Gly Arg Ala Met Gly Arg Val Leu Ala Leu Gly Gly Val Val 

85 90 95 

Ala Asp Glu Trp Glu Arg Arg Ala He Ala Val Phe Gly His Arg Pro 

100 105 HO 

Tyr He Gly Val Ser Ala Asn He Val Ala Ala Ala Gin Leu Pro Gly 

115 120 125 

Trp Asp Ala Gin Ala Val Thr Arg Arg Ala Leu Gly Glu Gin Pro Gin 

130 135 140 

Val Thr Glu Leu Leu Pro Phe Gly Arg Pro Gin Leu Ala Gly Gly Pro 
145 150 155 150 

Leu Gly Ser Val Ala Lys Val Val Val Thr Ala Arg Ser Leu 
165 170 



(2) INFORMATION FOR SEQ ID NO: 2 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:287: 

Val Gly Val Val Gly Val Gly Ala Thr Ser Pro Ala Gly Ala Gly Ala 

15 10 15 

Gly Ala Gly Ser Ala Gly Thr Gly Ala Gly Ala Gly Gly Gly Ala Thr 

20 25 30 

Lys Gly Arg lie Asp Ser Ala Ser Ala Leu Ala Ala Pro Leu Ser Thr 

35 40 45 

Gly Leu Leu Ala Val Pro Ser His Thr Thr Asn Gin Arg 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 2 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:288: 



Met Ala Asn Thr Gly Ser Leu Val 

1 5 
Trp Asn Ala Leu Asn Leu Phe Thr 
20 

Asp Lys Gly Gin Ala Glu Ala Val 

35 40 
His Asp Leu Leu Pro Asp Val Leu 

50 55 
lie Thr Thr Ala His Leu Ala Leu 
65 70 
Pro Val Arg Arg Ser Trp Arg Leu 
85 

Gin Gly Leu Asp Lys Ala Glu Thr 
100 

Phe Met Ala Trp Arg Arg Ser Tyr 
115 120 
Arg Gly Ser Gin Phe 
130 



Leu Leu Arg His Gly Glu Ser Asp 

10 15 
Gly Trp Val Asp Val Gly Leu Thr 
25 30 
Arg Ser Gly Glu Leu lie Ala Glu 
45 

Tyr Thr Ser Leu Leu Arg Arg Ala 
60 

Asp Ser Ala Asp Arg Leu Trp lie 

75 80 
Asn Glu Arg His Tyr Gly Ala Leu 

90 95 
Lys Ala Arg Tyr Gly Glu Glu Gin 
105 110 
Asp Thr Pro Pro Pro Pro lie Glu 
125 



(2) INFORMATION FOR SEQ ID NO: 28 9: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

Pro Gly Ser Phe Ala Arg Thr Lys Pro Pro Gly Arg Thr Ala Asp Ala 

15 10 15 

Pro lie Arg Cys Arg Asp Ser Arg Gly Thr Ala Gly His Arg Ala Leu 

20 25 30 

Asp Glu Pro Pro Pro Arg Gly Ser Glu Pro Ala Arg Arg Arg Ser Arg 

35 40 45 

Gly Val Arg Thr Val Val His Asp Ser Leu Ala Ala Arg Arg Val 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 2 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

Jf (C) STRANDEDNESS : single 

■= (D) TOPOLOGY: linear 

EC (ii) MOLECULE TYPE: protein 

TO (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

Ly Gly His Gly Gly Gin Ser Ala He Gly Leu Gly Gly Gly Ala Gly Gly 

s_ 1 5 10 15 

P Asp Gly Gly Gin Gly Gly Ala Gly Arg Gly Leu Trp Gly Thr Gly Gly 

FU 2 0 2 5 30 

ffj Ala Gly Gly His Gly Gly Ala Arg Arg Trp Tyr Arg Gly Pro Thr Ala 

|H 35 40 45 

O Ala Arg Ser Gly Arg His Gly Arg Arg Gly Trp Arg Arg Trp Ala Asp 

fy 50 55 60 

Arg Gin Arg Arg Gly Arg Arg Arg 

65 70 

(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 91: 

Asp His Arg Arg Arg Ser Leu Ala Ser Leu Arg Ser Ala Ser Ser Pro 

15 10 15 

Ala Arg He Thr Glu Val Arg Pro Cys Thr Pro Leu Leu Glu Arg Ser 

20 25 30 

Ala Pro Gin Ser Gly Ser Arg Asp Pro Phe Arg Pro Trp Pro Ala Asp 

35 40 45 

Ala Gly His Ala Arg Ser Pro Ala Trp Tyr Arg Leu Gly Ala Gly Asn 
50 55 60 
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Pro lie Pro Val Arg Ala Ala His His Glu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 2 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:292: 

CCGCACGTAA CACCGTGAAT TGAAGGGAGC CGCTGGTCAT GGGCCGATTC TATCCGTGGG 6 0 

CGAACGGTTA TTGACGGCCC GGAGGCCACT CCGCTGCCAC CAAGTGGTGA CTCAGCGCGT 12 0 
H TTTCACGGCA ACGAACGGCG GACACACCAC TTGACATTCG ACAGCACGGC CGCG 174 

" 3 

Q (2) INFORMATION FOR SEQ ID NO: 2 93: 

m 

=C (i) SEQUENCE CHARACTERISTICS: 

£0 (A) LENGTH: 404 base pairs 

=p (B) TYPE: nucleic acid 

|ij (C) STRANDEDNESS: single 

s (D) TOPOLOGY: linear 

D 

fy (ii) MOLECULE TYPE: cDNA 

lf\ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 93: 

jft TCGCAAACGG GGTGACGTTC CGTCCGGTGG CGCTAGAGAG TTTGTCGCAC TTTCCGGTGA 60 

CCGTCGCCGC GCACCGCAGC ACCGGTGAGC TCACGCTGCT AGTGGAGGTG CTCGACGGTG 120 
CGCTGGGCAC GATGGCGCCC GAAAGCCTCG GCAGGCGGGT GCTGGCTGTG TTACAGCGCT 180 
TGGTCAGCCG GTGGGATCGG CCGCTGCGCG ACGTCGACAT TCTGCTGGAC GGCGAGCACG 24 0 
ATCCGACCGC ACCCGGCCTG CCGGATGTGA CGACGTCGGC ACCCGCGGTG CATACCCGGT 300 
TCGCCGAAAT CGCTGCGGCA CAGCCTGACT CGGTGGCGGT CAGTTGGGCG GATGGTCAGC 36 0 

TGACGTACCG GGAGCTGGAT GCATTGGCCG ACCGGCTGGC CACT 404 

(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:294: 

Ala Asn Gly Val Thr Phe Arg Pro Val Ala Leu Glu Ser Leu Ser His 

15 10 15 

Phe Pro Val Thr Val Ala Ala His Arg Ser Thr Gly Glu Leu Thr Leu 
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Leu Val Glu Val Leu Asp Gly Ala 

35 40 
Leu Gly Arg Arg Val Leu Ala Val 

50 55 
Asp Arg Pro Leu Arg Asp Val Asp 
65 70 
Pro Thr Ala Pro Gly Leu Pro Asp 
85 

His Thr Arg Phe Ala Glu lie Ala 

100 

Val Ser Trp Ala Asp Gly Gin Leu 
115 120 
Ala Asp Arg Leu Ala Thr 
130 



Leu Gly Thr Met Ala Pro Glu Ser 
45 

Leu Gin Arg Leu Val Ser Arg Trp 
60 

lie Leu Leu Asp Gly Glu His Asp 

75 •' 80 

Val Thr Thr Ser Ala Pro Ala Val 

90 95 
Ala Ala Gin Pro Asp Ser Val Ala 
105 110 
Thr Tyr Arg Glu Leu Asp Ala Leu 
125 



(2) INFORMATION FOR SEQ ID NO: 295: 

?j (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 
U (B) TYPE: nucleic acid 

03 (C) STRANDEDNESS : single 

4= (D) TOPOLOGY: linear 

m 

=fi (ii) MOLECULE TYPE: cDNA 

yy 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 95: 

fy GCTTCGACGG CTACGAGTAC CTGTTCTGGG TGGGTTGTGC GGGCGCCTAC GACGACAAGG 60 

fU CCAAGAAGAC CACCAAGGCC GTCGCCGAGC TGTTCGCCGT CGCCGGGGTG AAATACTTGG 12 0 

jjfi TGCTGGGCGC TGGGGAAACC TGCAACGGCG ACTCGGCGCG CCGCTCCGGC AACGAGTTCC 180 

Q TCTTCCAGCA GCTGGCACAA CAGGCCGTCG AGACC CTGGA CGGTTTGTTC GAGGGTGTGG 24 0 

fy AGACCGTCGA CCGCAAGATC GTTGTCACCT GCCCGCACTG CTTCAACACC ATCGGCAAGG 30 0 

AATATCGGCA GCTGGGCGCC AACTACACCG TGCTGCACCA CACCCAGCTG CTCAATCGGT 360 

TGGTGCGCGA CAAGAGGCTG GTCCCTGTCA CTCCGGTTTC TCAGGACATC ACCTACCACG 42 0 

ACCCGTGCTA CCTGGGTCGG CACAACAAGG TCTACGAGGC ACCACGGGAG CTGATCGGTG 480 

CCGCGGGGGC CACCTGAGCC GAGATGCCGC GCCATGCCGA CCGCAG 526 



(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 487 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 96: 



CTCGCCGCCG TGATCTGGCC GGCGAACTTC GTCAGTGCAT CCAGACCCCA ACGATCATCG 6 0 

ATCAGGCCGA TGCCCATGAT CACCGCACCG GCCACCAGCA CCGCGGGCAT GCCGGTGGAA 12 0 

TAGACGAACC CCCGGGTGAG TGCCGGAAGC TGGGAGGCAA GAAAGACGGC GCCGACAATG 18 0 

CCCAGGAACA TCGCCAACCC ACCCATCCGA GGGGTAGGCG TGACGTGCAC ATCTCGCTCC 24 0 

CGCGGGTAGG CGACGGCTCC CAGGCGACTG GCCAGCATCC GCACCGGACC GGTCGCAAAA 30 0 

TAGGTGATGA TCGCCGCGGT CAGCCCGACC AGCGCAAGCT CACGCAGCGG GACAC CGGCG 36 0 



243 



CCGCGATAGG ACAGGGCGAG CAAGCCACCG GCAACGCCGG CCACATCGCT GGACACCTCG 420 

AGACCGTACT GCACCAACCT GAAGAGCTGA ACACTCGCCG AACGTGCAAC AGCTGCGAAC 480 

AATTGGG 487 

(2) INFORMATION FOR SEQ ID NO: 2 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

ACGAAGCGCG AGAATATGAG CCGGGGCAAC CCGGCATGTA CGAGCTTGAG TTCCCGGCGC 60 

CTCAGCTGTC GTCGTCCGAC GGCCGTGGTC CGGTGTTGGT GCACGCTTTG GAAGGTTTCT 120 

CCGACGCCGG CCATGCGATC CGGCTGGCCG CCGCCCACCT CAAGGCGGCC CTGGACACAG 180 

AGCTGGTCGC GTCCTTCGCG ATCGATGAAC TACTGGACTA CCGCTCGCGG CGGCCATTAA 240 

TGACTTTCAA GACCGATCAT TTCACCCACT CCGATGATCC TGAGCTAAGC CTGTATGCGC 3 00 

TGCGCGACAG CATCGGCACC CCATTTCTGC TGCTGGCGGG TTTGGAGCCG GACCTGAAGT 36 0 

GGGAGCGGTT CATCACCGCC GTCCGATTGC TGGCCGAGCG CCTGGGTGTA CGGCAGAACC 42 0 

ATCGGCCTGG GCACCGTCCC GATGGCCGTT CCGCACACAC GAC CGATCAC GATGACCGCT 48 0 

CATTCCAACA ACCGGGAGCT ATCTCCGATT TTCAACCGTT CGATCTCC 52 8 

(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 

CCAAGCCCGT CAAGGAGCCG GTGCCGGCCT TGCCTCCGGT GCCGCCGACG CCGGCGTTGC 60 

CGCCGTTGCC GCCGTTGCCG CCGGTACCGG GGTTTCCTAC GGTGCCGCCG CCCGGCAGCA 120 

TGGCCCCGCT GTTTAGGCCG TTTTCGCCGG CCCCGCCGTC AC CGGCTTTG CCGCCATCGC 180 

CGCCGTTGCC GCCGCTGGTG GGGGTGGCGG CCTGGTTGAC GTATTGTTCC ACCGGCCCGG 240 

CCCTTGACCC TTTGGCGGTG TCGATCGCGG CGTCGATGGA TCCGCCGACC ACGACGTGCG 3 00 ' 

AAGCCTCGCC TGCCGCCGCA GCCGCCCAAC TGTGTCGCGG CTCCTGCGAT TTGGCCCCGG 3 60 

CCGACGAGAT GATGGGCACC ACCGGAGCCT GCGGCCGTCT GGGGGAGGCC AGCGCGGGTT 420 

CGCGGTCACG CCATACGCGA CGGTGCGCCG CCGCTTCGGA GATTTGCAGG CTGCGTTGCA 4 80 

CCAGATCGAG CAGCGGTGTG CCCAGGGACT GGGTTAGCCC GTTGGCGCCG CCGTTGTAGC 540 

GGCGAGCGCA ATATCGGTGC CCACTCGACC CAACCGCGAC TCCATAAGCG ACACCATTCG 6 00 

CGGTTGATGC 6 10 

(2) INFORMATION FOR SEQ ID NO: 299: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:299: 



Phe Asp Gly Tyr 


Glu 


Tyr 


Leu 


Phe 


Trp Val Gly Cys 


Ala Gly Ala Tyr 


1 


5 








10 




15 




Asp Asp Lys Ala 


Lys 


Lys 


Thr 


Thr 


Lys Ala Val Ala 


Glu 


Leu Phe 


Ala 


20 










25 




30 




Val Ala Gly Val 


Lys 


Tyr 


Leu 


Val 


Leu Gly Ala Gly Glu 


Thr Cys 


Asn 


35 








40 




45 






Gly Asp Ser Ala 


Arg 


Arg 


Ser 


Gly 


Asn Glu Phe Leu 


Phe 


Gin Gin 


Leu 


50 






55 




60 








Ala Gin Gin Ala 


Val 


Glu 


Thr 


Leu 


Asp Gly Leu Phe Glu Gly Val 


Glu 


65 




70 






75 






80 


Thr Val Asp Arg 


Lys 


He 


Val 


Val 


Thr Cys Pro His 


Cys 


Phe Asn 


Thr 




85 








90 




95 




lie Gly Lys Glu 


Tyr 


Arg 


Gin 


Leu 


Gly Ala Asn Tyr 


Thr 


Val Leu 


His 


100 










105 




110 




His Thr Gin Leu 


Leu 


Asn 


Arg 


Leu 


Val Arg Asp Lys 


Arg 


Leu Val 


Pro 


115 








120 




125 






Val Thr Pro Val 


Ser 


Gin 


Asp 


He 


Thr Tyr His Asp 


Pro 


Cys Tyr 


Leu 


130 






135 




140 








Gly Arg His Asn 


Lys 


Val 


Tyr 


Glu 


Ala Pro Arg Glu 


Leu 


He Gly Ala 


145 




150 






155 






160 


Ala Gly Ala Thr 



















(2) INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE 


TYPE: protein 




(xi) SEQUENCE 


DESCRIPTION: 


SEQ ID NO: 300 : 




Arg Arg Arg Asp 


Leu 


Ala Gly Glu 


Leu Arg Gin Cys 


He Gin Thr Pro 


l 


5 




10 


15 


Thr He He Asp 


Gin 


Ala Asp Ala 


His Asp His Arg Thr Gly His Gin 


20 






25 


30 


His Arg Gly His 


Ala 


Gly Gly He 


Asp Glu Pro Pro 


Gly Glu Cys Arg 


35 




40 




45 


Lys Leu Gly Gly 


Lys 


Lys Asp Gly 


Ala Asp Asn Ala 


Gin Glu His Arg 


50 




55 


60 




Gin Pro Thr His 


Pro 


Arg Gly Arg 


Arg Asp Val His 


He Ser Leu Pro 


65 




70 


75 


80 


Arg Val Gly Asp 


Gly 


Ser Gin Ala 


Thr Gly Gin His 


Pro His Arg Thr 




85 




90 


95 


Gly Arg Lys He 


Gly 


Asp Asp Arg 


Arg Gly Gin Pro 


Asp Gin Arg Lys 
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100 105 110 

Leu Thr Gin Arg Asp Thr Gly Ala Ala lie Gly Gin Gly Glu Gin Ala 

115 120 125 

Thr Gly Asn Ala Gly His He Ala Gly His Leu Glu Thr Val Leu His 

130 135 140 

Gin Pro Glu Glu Leu Asn Thr Arg Arg Thr Cys Asn Ser'Cys Glu Gin 
145 150 155 * 160 

Leu 



(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

M 

C3 (ii) MOLECULE TYPE: protein 



m 


(: 


<i) i 


3EQU] 


3NCE 


DESCRIPTION 


: SEQ ID NO: 301: 


































Glu Ala 


Arg 


Glu 


Tyr 


Glu 


Pro 


Gly 


Gin Pro Gly Met 


Tyr Glu Leu Glu 




1 






5 








10 






15 


., 


Phe Pro 


Ala 


Pro 


Gin 


Leu 


Ser 


Ser 


Ser Asp Gly Arg 


Gly Pro Val Leu 








20 










25 




30 






Val His 


Ala 


Leu 


Glu 


Gly 


Phe 


Ser 


Asp Ala Gly His 


Ala 


He 


Arg Leu 






35 










40 




45 








Ala Ala 


Ala 


His 


Leu 


Lys 


Ala 


Ala 


Leu Asp Thr Glu 


Leu 


Val 


Ala Ser 




50 










55 




SO 










Phe Ala 


He 


Asp 


Glu 


Leu 


Leu 


Asp 


Tyr Arg Ser Arg 


Arg 


Pro 


Leu Met 




65 








70 






75 






80 




Thr Phe 


Lys 


Thr 


Asp 
85 


His 


Phe 


Thr 


His Ser Asp Asp 
90 


Pro 


Glu 


Leu Ser 
95 




Leu Tyr 


Ala 


Leu 
100 


Arg 


Asp 


Ser 


He 


Gly Thr Pro Phe 
105 


Leu 


Leu 
110 


Leu Ala 




Gly Leu 


Glu 
115 


Pro 


Asp 


Leu 


Lys 


Trp 
120 


Glu Arg Phe He 


Thr 

125 


Ala 


Val Arg 




Leu Leu 


Ala 


Glu 


Arg 


Leu 


Gly 


Val 


Arg Gin Asn His 


Arg 


Pro 


Gly His 




130 










135 




140 










Arg Pro 


Asp 


Gly 


Arg 


Ser 


Ala 


His 


Thr Thr Asp His 


Asp 


Asp 


Arg Ser 




145 








150 






155 






160 




Phe Gin 


Gin 


Pro 


Gly 
165 


Ala 


He 


Ser 


Asp Phe Gin Pro 
170 


Phe 


Asp 


Leu 
175 



(2) INFORMATION FOR SEQ ID NO: 3 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:302: 





Lys 


Pro 


Val 


Lys 


Glu 


Pro 


Val 


Pro 


Ala Leu Pro Pro Val Pro 


Pro 


Thr 




1 








5 








10 


15 






Pro 


Ala 


Leu 


Pro 
20 


Pro 


Leu 


Pro 


Pro 


Leu Pro Pro Val Pro Gly 

25 3-0 


Phe 


Pro 




Thr 


Val 


Pro 
35 


Pro 


Pro 


Gly 


Ser 


Met 
40 


Ala Pro Leu Phe Arg Pro 
45 


Phe 


Ser 




Pro 


Ala 
50 


Pro 


Pro 


Ser 


Pro 


Ala 
55 


Leu 


Pro Pro Ser Pro Pro Leu 
60 


Pro 


Pro 




Leu 


Val 


Gly 


Val 


Ala 


Ala 


Trp 


Leu 


Thr Tyr Cys Ser Thr Gly 


Pro 


Ala 




65 










70 






75 




80 




Leu 


Asp 


Pro 


Leu 


Ala 
85 


Val 


Ser 


He 


Ala Ala Ser Met Asp Pro 
90 


Pro 
95 


Thr 




Thr 


Thr 


Cys 


Glu 
100 


Ala 


Ser 


Pro 


Ala 


Ala Ala Ala Ala Gin Leu 
105 110 


Cys 


Arg 




Gly 


Ser 


Cys 


Asp 


Leu 


Ala 


Pro 


Ala 


Asp Glu Met Met Gly Thr Thr 


Gly 








115 










120 


125 








Ala 
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£ 
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160 


m 
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Ser 


Gly 
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Pro 
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Q 

ffj (2) INFORMATION FOR SEQ ID NO: 303: 

I'fi (i) SEQUENCE CHARACTERISTICS: 

i (A) LENGTH: 921 base pairs 

="1 (B) TYPE: nucleic acid 

§y (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

AATTCGGCAC GARCAGCACC AACACCGGCT TCTTCAACTC CGGCGACGTC AATAC CGGTA 60 

TCGGCAACAC CGGCAGCTTC AACACCGGCA GCTTCAATCC GGGCGATTCC AACACCGGGG 120 

ATTTCAACCC ANGCAGCTAC CACACGGGGA CTCGGAAACA CCGGCGATTT TACACCGGCS 180 

CCTTCATCTC CGGCAGCTAC AGCAACGGGT CTTGTGGAGT GGAAATTATC AGGGCTCATT 240 

GGNTGCACCC GGS CTTRCGA ATCCCTCGKG CCAATTCAAC TCCTCNACAA GCTTGCGGCC 3 00 

GCACTCSAGC CCGGGTGAAT GATTGAGTTT AACCGCTNAN CAATAACTAG CATAACCCCT 360 

TKGGGC CTCT AAACGGGTCT TGAAGGGTTT TTTGCTGAAA GGANGAACTA TATCCGGATA 420 

ACTGGCGTAN TACGAAAAGC CGCACCGATC GCCTTCCCAA CAGTTGCGCA CCKGAATGGC 480 

AATGGACCNC CCTKTTACCG GSCATTAACN CGGGGGTGTN GGKGTTACCC CCACGTNACC 540 

GCTACCTTGC CANNSSCCTN RSGCCGTCTT TCSTTTCTTC CTTCCTTCTC CCMCTTCGCC 6 00 

GGTTCCCNTC AGCTCTAAAT CGGGGNNCCC TTTMGGGTTC CAATTATTGC TTACNGSCCC 660 

CCACCCCAAA AAYTNATTNG GGTTAATGTC CCTTMTTGGG CNTCCCCCTA WTNANNGTTT 72 0 

TCCCCCTTNA CTTTGRS TCC CTTCYTTATW NTGAMNCTNT TTCCACYGGA AAAMNCTCCA 780 

CCNTTYSSGS TTTCCTTTGA WTTATMRGGR AATTS CAATY CCGCYTTKGG TTMAANTTAA 840 

CYTATTTCNA ATTTTCCCGM TTTTMMNATR TTNSNCKCGM KNCTCCNRKA SSGNTTTCCT 900 

CCCCCYTTSS GKTYCCCCRN G 921 
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(2) INFORMATION FOR SEQ ID NO:304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:304: 

AATTCGGCAC GAGATANGGG CGCACCGGGG TCCGCAGCCG GCGGGACCGT CGCCAGCACC 60 

ACCGGGGTCA ACAGCACCAC GGTGGCGTCC ANGCAGAGCG CCGCGGTGAT GGCGGCCGAG 120 

ACGGCRAACA CCTGCCGTAG CAGTCGGTGC GACTCCGCGC TCGCTCGANC CATGGCCGCG 180 

CCGGCTGCCT CGAACANGCC TTCGTCGTCC ACAGCTTAGC CAGCANCCAA ACCGCACCCA 240 

GAAACCCACA CGCCCGCCGC CCCGGANACC TGCGCCATCG KCTGCTGGGG CGANATCCCC 3 00 

CGATCGCTNA CANGATGACC GCTGCCGGAA CGCCGCCGCT GCCTCCGGGC AGCCGCGTGG 360 

GCSGGGCAAC CGCGAACCCA NGAACACGGC AAGCAGTATC ANCGCAACAG CAATTGTCAA 42 0 

GGGCTAAACG CTTCACATCC AGGGATCTCG CGGCGCCACA CCGTCGGMTC TGCAGSGCGA 4 80 

CCCCNTCCTN GGGCGGNCAC TCNTCAAAGA TGCNGATCNA CAGKCTAGGT CTTCGGCCGA 540 

TATGSAAGGN CCCAACGGNT TTAAAGCGGC SAAAAAASTC TCCCANTGGA TAAAATCAGC 6 00 

CGGGGANCCC CCCGTGSCMM NGTCYCGGKC ATTNTTCAAC MGGTTTNACG GCGGKTGCNG 660 

GCCAACTKGC CAAAMTTAAG KTNGGGGNTY CGGGGCGGTA AC CGGCNNTK NGCCCCTTAA 720 

AAAACCGGNC YTTTCTKGAT TAMMACCGGN CCCCCAWTGG CGGKTGKTCC CANGNTYAAC 780 

AMCCYCCCSS MNGGGKTGGS SAACCCTTCC CGNGGGGTTC NTKGTTSCYT AWMCCCCCGG 840 

AAACCSGKYG GGKT GGCRTN WASSAMNCCC CMNGYYTCTT T AAAGGC CAN KNRAAWGKYT 900 

CCTTGGGAAW CCTNCAATYC GAAAAYYCTC CTYMMGSSCN CTTKCWRTYN NRNGGGAACS 96 0 

AMWTNYC CNC GWTTCAWTCG GGTCCGASMN AAACKCTTTY TTTTYCGSSC STCCMGGSNC 102 0 

SGGTKNANAN AAASATTTMC YYCNNNANKK YYYCSSGCTT CYKMGRRNRR GMGAACCCGR 108 0 

GS 1082 

(2) INFORMATION FOR SEQ ID NO: 3 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:305: 

AATTGGCACG AGTGATCGCG CTGAAGCCGG TAGCGCGGGT GGCTCGGGTG GTTTGCGAAC 60 

RAAATCCGCT CGANGTGGTC TCGGTAGGCG GTGTCCANAA CGGTGGCGCG GTGCCGGCGG 120 

ATCTGATCGG CGCGGCCGTA GTGCACGTCG GCGGGCGTGT GCAGTCCGAT GCCGGAATGC 180 

TTGTGTTCGT GGTTGT AC C A GCCGAAGAAC CGGTCGCAGT GCACCCGGGC CGCCTCGATC 240 

GACTCGAACC GTTTCGGGAA ATCGGGCCGG TACTTGAAGG TCTYGAACTG GGCCTCAGAC 3 00 

AACGGGTTGT CTTGCTGGTG TGCGGGCGTG AGTGCGACTT GGTGACACCG AAGTCGGCCA 360 

NCANCAATGC CACCGGTTTG GAACTCATCC ACAACCCCCG TCCGCGTCMA GGTCAC TTGT 420 

NCGGCGCTAA TTTNYTGGGC GGCAAGGGTT TGCCGAYCAN KCCGCTCGGC CAAAACTTCG 4 80 

ANTCNCSCCA AGGCCNCCAT CCNCCCAAAC AMGTTACGGG ANAAAANATY CAAAGAYCAC 540 

CYTCCGGKTN TTATANCTYC CCYTTTGSTY GGGCCCCCCN CYYTGKKNAT ACCCCTNCCA 600 
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AWTCCCAACN CCCKCCAANA RCYKGGGGCC CCCNCCAACC CGGGKGAAKA WTAATTTAAA 660 

CCCYAACMAW ACTWMMNACC CNNGGGSCCY AAMCGTYYNR AGGTTTTSCT NAAAGAAASA 720 

ANTCGGAAMC CGGNTSTACC AAAAASCCCK CCNWTCCCTC CRASATTGSC NCCSAAWKSA 780 

AKGCCCCCNY TCSGCNWNNC CSGCGGKKKT KKGTTNCCCT WMRCWMWYTS GGCCNASCCN 840 

CKYYSSMYCC CCCCTCCCCM CTCCGNKTCC CCAMCCYANC MGGCCCCYTM GKKCCCWKNT 900 

YKGCCCCCCC AMMNNNGGGG WGACCCTNGG CCCCMKRRGM TCCCNANTGA MCCTCWGNRA 960 

MKCYCCNRAR ANMCCSCNCC NGCNCRCKNN 990 

(2) INFORMATION FOR SEQ ID NO: 3 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 base pairs 

(B) TYPE : nucleic acid. 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:306: 

AATTCGGGTG GCAACGCGGG CCTGTTCGGC AACGGCGGCG CCGGTGGTGC CGGTGGGGCT 60 

GGTGGTGGCG CCGGCGGCGC GGGCGGTAAC GCGGGGTGGT TTGGTCATGG GGGCGCTGGC 12 0 

GGCGTGGGTG GTGTANGTGC GGCCGGGGCC AACGGTGCTA CGCCCGGTCA GGATGGGGCG 180 

GCTGGTGTTG CCGGGTCGGA CRACRCTCGT GCCGCTCGTG CCG 223 

(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

AATTCGGCAC GANGCGGCAA CGGTGGCAGC GGCGGCACGT CNGTTGC CAC CGGGGGGGCC 60 

GGGAACGGCG GTGC CGGCGG CGCCGGCGGC GGGGCCGGGC TGATCGGCAA CGGCSGCAAC 12 0 

GGCGGCAGTG GCGGAATGGG CGATGCCCCG GGCGGCACCG GCGTCNGCGG CATCRGTGGG 180 

CTGTTGTTGG GTTTGGACRG CGCCAACGCC CCGGCCAGCA CCAACCCGCT GCACACCGCG 24 0 

CAGCACAGGC GTTGGCCGCA GTCAACG CGC CCATCCAGGC CGTGACCGGG CGCCCCTGAT 30 0 > 

CGGCAACGCG CCAACGGCGC CCCGGGCAAC GGGGCCCCCG GCRGGCACGG CGGGTGGTTG 360 

TTCGGCGGCG GAAGGAACGG CGGGTCCGGC GTCANCRGCG GGGCGGGCGG AAATGCCG 418 

(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

AATTCGGCAC GAGGGGCACG ATCG CATACA GCGCTCGCGG CAGACCCGCC CGATACAGCA 60 

GCTCGGCACA CGCGAGCGCA CAATACGGCG TCTGGCTGTC CGGCTTGARC ACCACCGCGT 120 

TACCGGCCAC CAGCGCGGGC ACCGAGTCCG ACAC CGTAAG CGTCATGGGG TAGTTCCACG 180 

GCGAGATCAC CCCCACCACG CCCTTCGGTT GATAGCACAC CGTGGTCTTG CCTATCCCGG 240 

GCAGCAGCGG CTGTGCCTTA CGGGGCTTCA GCAGGTCCAC ACAGACTCGT GCSTTATAAT 3 00 

TNCGCSTTCC GCGATCAGAT CGACAATTTC CTCTTGCGCC GCCCATCGGG CCTTGCCCGC 360 

CTCGGCTTGC AGGAAGTCCA TGAAGAACTC GCGGTTCTCG ATNAACAGGT CGCGATAGCG 420 

GCS GATGACT GCAGCTCGCT CGATNACGGG ACCTTCGCCA GTCGGTCTGC GCCGCGCGAN 480 

CTTCCGCGAA TGCCGCTTCG ACTTCCGCGG NCGTGCCAAC GGAATCNTAT CACGGGTTGC 540 

CGGTTAAAAC TCCTCAATST NCYGGTCGAA ATTCGGCAAC TTCTTATCCC GGCAGGTRCC 600 

AACSANNCAA ACCTCGGCAA GGTTAGGMTT TCCCCCNCTT YCAAAAATNC GGKTTTTGGN 660 

CMAATTTCGC CKCNATGKTG MCAAGGMTCT CKAANAAKCS GGGTCYTCTN NTCNGKGGAK 720 

CCAAAMGGKT TTGGGGMAGC GKNMNCCAAN CCTWACCCTG KTKAANGGNW TTCCCCCCGG 780 

GGGAKKGNGA ATYCYCCSNA NCCCRGGGGG GNMCARATTC TYCCGGMCTC CTCKGGAWTC 840 

WGMGSTTTCC CAAAAAACSC CCCAAATTMM TTTTTC CRCN TRTTGANACW CTTTTKARCA 900 

MMCSSAARNS ANMCNCTCYC CKCTKTGKTK AAAAAGNAYW CCCCMAAATT TYTAWTTSSC 960 

CCSCGCGGGN CCCNCTNTTT TSCNMTWCTM WNYTNCRMCC MMMSNCKSNG KKGGNRC CNN 1020 

CRCCSNCCCM AAWYNTKGYN KNTATMAGC 1049 

(2) INFORMATION FOR SEQ ID NO:309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1036 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

AATTCGGCAC GAGGGAATCG AGAATCCCGG AATGGTGAAG CCTCGGTGCC TGCCGTTACG 60 

CCAAGAKTCA GGGTGAGCGG CCCCCCGGTG GGAATGCTGA SGCCAACCGG GAAAAGGGTG 120 

AGGGCTGGGG TGGAATAACT GAANGTTACT GGGATGGAAA ACCCGGTATT GATATGTATT 180 

GGGCCGATCA ANGTTGTGGG AATGGGGGAA GGCTGAGGGC GACCTGTTGG ATTTGGGGAA 240 

TTGTYRTGGA CRAKACWGGC CAGCCMGCGT GATGGTTTGG TTSAANTTTT GTGCCGSCCA 300 

CANGGTGATG GGATTGATTT TGATGGGGCC SATCGAAATA TTGGGTATGC CNACGCCSAA 360 

CGAGATYGCC GGGACGTTCA TGGGCGGGAC AACCMASGGT CCSANGTAAK GGTTTCCTTN 42 0 

ATNTTGATCG GGATTCCGGA ACTMTS TCGA TGSGCTCSAY MTSATSGCCC NACNCCWCCG 48 0 

YTTATTTCMS GCTNAYGGGA ATBAMRGGAA CAAYNTCCCT CCCMGGAAAA ACCAACMSGC 54 0 

CCTGGTNSYC CNCCCRCCNC AKAACCCRTT KCTGTRS TMC CCSMAAATNA CSCCCSCTTS 600 

NACTCCNCSG AANTNSCCCC CCCSCKNNTT ATSTYCCCGK GTTCCCCCMC CCCTTNAAMC 66 0 

TCCCCGGTTA ACCCCCWTNT SNCNCCCCCS YTAAKMNCRG GCTTS TTNCT CCCCCYTRMK 72 0 

CNCCCCCTCK SAMCWNCCNC CTCKAACNAC CCCKCYKGSM TNCCCAATNT WCMWCKCCNS 780 

KTTNTMCTKC CCAAYTNCRC CCNCRCTCCC CCKSTSTCAM WTATAAAACC WCWYAWYNNK 84 0 

KCNCWMAWTA MGACWCTCNY NCCCCNCNCK NTTKTAMWCC CKMCCCKCSW TWCYCKCSCC 900 

CCMTCTMNAC YCCCCCKKTY NKWMCCCTTC CCCCCCTCCC MCNMBMKTCT YCSGKTWCWC 960 

NCYNTTMTCN CYNANMCKCK KTCTCTTCCN CRNTCTCCCC CCWCCCCCCV KKCTCTSKCC 102 0 

CNCNCTCCSC MMKGSC 1036 

(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1036 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

AATTCGGCAC GAGATCATGA ATAGCGGGCT GGTCAGCACC GAAGTGGTCG GCGATCTCGC 60 

GAGCAAGTCT CGTCTGCTCG CCCAGCAGGA GGTCGGCATC GATGCGGACA CCTGCGATGT 120 

CTTGGATGGT GTTCAGTTGC AGGTAAGGCC GACGCCGCAG CTTTGCTAGC AGGGTGTCTT 180 

GGCTCTTCGC ACGTGAGGTA ACCAATAACT CCGACGCAGA CCAACTCCGG CCCTCGATCC 240 

GGGTACCAGG CTCCGCCGGA GCCAGCCGTT GTGCCCCCTG GGCCGAAGGT CAGCTGCTGT 300 

GCGATCGAAG TAAGAAACCG CGCCATGCCC GTCGCCAAGT ACGACTGACC GAGCAAACGA 360 

ACGATCGTCG TCCTTTCCGT GGGGGTAATC GANCCCAGCA ACCGCACGAG CCACCAATCA 420 

TTGGGATTCG GCCACTGACC GACCAACCGC CTGTGCGACA CCCCAGCGGA ATTGGTGGTC 480 

TTCCGCGGGG CCGCNAACGG AATCANCGSG ACGCGCTCGC CGAASCANCC GCATANC CNT 540 

ACATANCAAC GGNNTCTGCG CCCACATTTC GGGS TTMTGC CCCTCNGCAA CSSNAAYNCC 6 00 

CCCAATTCYG AACNAAAAAA TTGGYCCATY ARNGTYCTCM CCAAAAACCN AWTCCCCKTA 660 

TCCCCCGGGG GGGRCCCCYY NMNAAAACGG CCCWWAANCC CCSGGGCSCC CGGGTTRWTN 72 0 

CCCCTTGTCG GCCCNCCSGG TTTGGTCMCM GGSCMMTNWN GGGNTGCSCC CCCNCNAAAA 780 

AAAAAYCKNG NCAAATYAAA CCCKYCMAAA ASKTGGGSSC CCCMARCCGG GGKAAKKWWA 840 

ANTTAANCCN KAAAAAAAWW NCANNMCCCC NGGGNCCTAA GGKYTTAGGG GTTS TTNANG 900 

ARAAAATMTC CANATMNSSK TTNNAAAAAA ASCCSWAKCC CCCNNNKKNN CCAAWKAARR 960 

SRCCTTCGGG TNWNSGGGGG KKKKKTNCMS KMNMMTTWGR CCCNCCGCCN NNTWKCCTTN 102 0 

TCCNYGGNGC RNCAGN 1036 

(2) INFORMATION FOR SEQ ID NO: 3 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

AATTCGGCAC GAGTCGATTC GATCGAACAC GCCCGCACCT GGCCAGGCCA CATGGGCGCG 6 0 

GCCATGGCCA ACGCCTACTC GGCCAACCCG AATCCATTCG GCGTCTCACC GCAACCCCCG 12 0 

AAACCGGCGA CCGCGGCATG GATCAACCCG CCCACCCCAG ATC CGAAATA GCGTCCACAT 18 0 

AATGAGACAC TGGCGCAAAG AGCTTGACAG GCGCCGCACC ACGCAAGCTG TTAGACGTGT 24 Q 

CGGTCTTGCA AGAAGCGGGT TGGCCACCCA AGATCACGCC GCCCAAGGGC ATCGAGTCAA 3 00 

CGTTGCGGTG GTATCGCGCT AACGTCGGCG CCGCCAAGAA ATGACGGTGC GCATTACCAT 360 

GGCCCTGCTG ATCACCTTTG GCCACCTGCG CACCANAACT ATGANCAGCC TTATGCCGAG 420 

TCTCGTGGAC ATCGGCAGCC GCTTCAAAAA CTCCTTGTCG ACAATSGTAT TGCTGANCCG 480 

CCGAATTCTT NTRCTTGCAA SAACACTNCA TGTTNCSGGT NAACAACCYT GGTTNGAAAA 540 

ACANCCAATA TTGAANTCCC ANTCGGGCAM GAACCNGTTM CGGAAGKTGK TGGGAACGAA 600 

TGKTGCCCAA AAATCCCGGG NGGTRAAAWW CCCNSNATGG MSAATTTTSC CTNGAACAAM 660 

AAAAGGTCCA AGKYCAAAGG NGCCCCCCCC SGNAAATTGG TGAACSCAKA WYANRTTCCC 720 

WWWTNCAAAT MTTNGGGTCC KNNTCCCCWT AAANGGGSCN CCCCNCCRGG GMGTYTCCCC 780 

NWNMGGGMGN CYYCSCCCCA AAAAAAAMMM MTTTCSGKGG SMGGKKCCCC CCSGGTYWGG 840 

GKKYTTAAAC CCGGKGGGTN CAAAAAANAN ACCCCCCAMS NGGGGGGAAA ATTTGNAAWT 900 
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AAGGKKKTKC SCMACCCCAA AAANMMNNCN AWNCCCGMGK SARGGGGRNY TTMKAGGGMG 
GNYCCCCCCW YCGGGGGGNA NAAYAAAAGK NGSNGRGAAT NTTNTTTTGK RSSSRNKTTT 
TYNTCCTYCN CCNMGNRWWG SRAMNTGKTS NSSGGGSGGC 



960 
1020 
1060 



(2) INFORMATION FOR SEQ ID NO: 3 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1040 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 



AATTCGGCAC 
GCAATACGGG 
CCAAGGTGAA 
TGTTNGTATG 
ATGGATGGTC 
TTCCCGCTAC 
CAAGCGAMTC 
CGAATGCGGC 
CCGNGNMNTG 
GTTGGACTTA 
SAANTCGGGC 
SAAACNTTTG 
TNTTTNCAAW 
AAAAAMCMSA 
TNGNGMGGGC 
NCTGMGGGKT 



GAGCTTCACC 
CATGGATGAN 
ACGCTTTGCG 
GATGCTGGAA 
CKSTTTTCNC 
RCTGCAGCCC 
GGGCATSCNC 
GCTYAAAAGC 
CGTTCTCTCC 
MRTNTTCAAA 
TTYGGGNRCC 
CCRTNCYMNN 
TCNCNS KTTT 
RRCCNCYGGG 
SMACCSGNGN 
TTKGNNGARN 
CRNNGANRCC 
CGYMTNKCGC 



AAAGAGCTGA 
CCGAANGGAN 
GCGAAAGATG 
CCGCGCTGAC 
TCCGCSGTTA 
AT C ATGGATG 
GCGGCAMTTT 
NGGCTTGCGT 
AACTC CGKKG 
AAMCGGMTNA 
CCCCNGAAYW 
TTTACAMGGC 
TTKGGGGGGG 
KKCCCCCCCM 
GYNAAAKGGT 
AANAMKMGGM 



CATGCCGGGT 
TCTGGCGTTC 
CGACGCTTAA 
NGATAANGAA 
AATTGCS TGT 
TGCGGCTAAC 
CGCAACCTGC 
TGATTMMAAC 
SYTGCCNCCG 
AC CS GAATNN 
TTCKNCNGGG 
NCMTNMTTGM 
GGCYGRTRMC 
NNATNGGGCG 
TSNSCTMANM 
RCGGNCGCNN 



GATGCGACAT 
GCTCAACTGG 
CTTGCGCTTC 
TTCGCTGGTC 
GCATCATCTG 
GAANAAGTTA 
TGTGTNTGAA 
CNAACCCNTN 
TGAAACCCMA 
SAACCTNCCR 
GMNNTYCTCN 



NCGGGCCCCC 
YKCRAAACAA 



GAAAGGGSMS 
GGGRKNNACN 



CGCATCGAGG 
ATTACGGTTC 
CACCGTGCAA 
GCCGGGCACN 
GCAGGCTATG 
TGACATGGCG 
GCGTMTCAAC 
CNATYCTTTG 
CTNCCCCCCC 
TCAAANTAMM 
GGTTYNGGCG 
GWCCCGGGKK 
GGCCCKKMAA 
ACCCCAANRA 
SGMSCCMNSN 
GSCKSCNNGN 
NMKMCAWSMC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
S60 
720 
780 
840 
900 
960 
1020 
1040 



(2) INFORMATION FOR SEQ ID NO: 3 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 348 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 



AATTCGGCAC GAGACAANGG CGTGAAATGG 
GACCTGTTGG CCAAGTCGCG GCCGCCGGTT 
AGCGTGAACT CAGGGCGAGA TTTCGGCCGT 
TKGGGAACGG TCAGGGTTCG CAAACCACGA 
ANTCCTGATA CTTKGGTACA TCGTGACCAA 
NGTCGCGTCC CGCGCGGTAA GGTCCANCAC 



GATCCGGCCG AGCTGGGGCC CGTCGTCAGC 60 

CCGGTCTATG GGGCCTAGTT ATCTGCGCCG 12 0 

TTTCTCGCCC TGGCTTCACG TTCGGCGAAG 180 

TCGGGATCGT GCGGTCGGTC CAGGACTGGT 240 

CTGTGGNCAA TATTCGGCGC GCTCCTCGTC 3 00 

TTCCTTTTTC TCGTGCCG 348 



(2) INFORMATION FOR SEQ ID NO: 3 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 14: 



AATTCGGCAC GAGAGACCGG GTCGTTGACC AACGGACGCT TGGGCGCGGG CCCCTTGCGT 6 0 

GGCATCAGCC CTTCTCCTTC TTAGCGCCGT AACGGCTGCG TGCCTGTTTG CGGTTCTTGA 120 

CACCCTGCGT ATC CAGCGAA CCGCGGATGA TCTTGTAGCG CACACCAGGC AGGTCCTTCA 18 0 

CCCGGCCGCC GCGCACCAGC ACCATCGAGT GCTCCTGCAG GTTGTGGCCC TCGCCGGGAA 24 0 

TGTACGCCGT GACCTCGAAC TGACTCGTCA CTTCACGCGG GCAACCTTCC GAAGCGCCGA 30 0 

GTTCGGCTTC TTCGGAGTGG TGGCTCGTGC CG 332 

(2) INFORMATION FOR SEQ ID NO: 315: 



m (i) SEQUENCE CHARACTERISTICS: 
j= (A) LENGTH: 962 base pairs 

fn (B) TYPE: nucleic acid 

Jr (C) STRANDEDNESS: single 

hi (D) TOPOLOGY: linear 

P= (ii) MOLECULE TYPE : Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 



AATTCGGCAC RAGTCGGTCT AGACGGATTC AATGCTCCCG CGAGCACCTC GCCACTGCAC 60 

ACCCTGCAGC AAAATGTGCT CAATGTGGTG AACGAGCCCT TCCAGACGCT CACCGGCCGC 120 

CCGCTGATCG GCAACGGCGC CAACGGGACT CCTGGAACCG GGGCTGACGC GGGGCCGGCG 180 

GGTGGCTGTT CGGCAACGGC GGCAACGGCG GGTCCGGGGC GAACGGAACC AACGGCGGGG 240 

ACGTGGGGAC GCGCCCGGCG GGATTTCTTC GCACCGGSGC ACCGGCGGGG CCGGCGGCGT 3 00 

CGCACAACGG CACCGGCGGG GACGCNGCGC CCGTNGGGCG GCTTCTKGAT GGGCTCCGGC 3 60 

GGTNACGCGG CACGGCGGCG CCCGGCTCAC CGCCNGTTGG GACGCGGGGA CGCGTNACCC 42 0 

CGATCTT CTT CCGCNCCCCG GAAAC CGCGG GGCCGGCCCC ACATTAKACC CGGCGGNACC 48 0 

GCGGMCCCGG CGGAACGGNG GGYNTTTTCC AACGGCGGGG CCGCGGAACC GNMGGSTGTT 540 

CCTTNGGSGA AGGNCCAAKT CCCGKCTANC YYAATCCCCG ANGGKTGAMC CTSATGSNCA 600 

MYTTMAGGAA C YTNC C CANT KTTSGRACCW CRC CNGGAAA ASRAWNKNGT KGGCAAACNA 660 

NNTNCYTTKN NATTKGGNNA AAAANCCCTY CCWCSGRACT NCCCCCCNGM GRGMCNNTNN 72 0 

NTTTYGNCNN CCCGGSNAAM RNTTKATTTC NGGGGGNTCN GGGTKMNNNA AACCCCAAAM 78 0 

MNRNNKCSCA ANGGGKSNGC NKNNMMNSGT TTTYCKNMRA MRNWTYKNKN NTCNGARSRN 840 

NAAMCNNSNK NGKKKNNKAA ARNNTTWKTN KNSCNNNCNN GRRNGVRGGC CKMKGSNMNG 900 

MCWHNAWRNG NNGSNCNCKC NNKMNAAAAA AAS GGVNCKS NSMKNKKKKG NRGGGGGGGG 960 



(2) INFORMATION FOR SEQ ID NO: 316: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 

AATTCGGCAC RAGAAGACGC CCGAANGTTT GCGCTGGCTC TACAACTTCA TCAARGCGCA 60 

GGGGGAACGC AACTTCGGCA AGATCTACGT TCGCTTCCCC GAAGCGGTCT CGATGCGCCA 12 0 

GTACCTCGGC GCACCGCACG GCGAGCTGAC CCAGGATCCG GCCGCGAAAC GGCTTGCGTT 180 

GCAGAAGATG TCGTTCGAGG TGGCCTGGAG GATTTTGCAN GCGACGCCNG TGACCGCGAC 240 

GGGTTTKGTG TCCGCACTGC TGCTCACCAC CCGCGGCACC GCGTTGACCT CGACCAGCTG 300 

CACCACTCGT GCCGCTCGTG CCG 323 

(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 base pairs 

(B) TYPE: nucleic acid 
Ms (C) STRANDEDNESS : single 
f=| (D) TOPOLOGY: linear 

S 

f~ (ii) MOLECULE TYPE: Genomic DNA 

i (xi) SEQUENCE DESCRIPTION: SEQ ID NO:317: 

AATTCGCAGT GTGTGTGGCG GCGTCCAGAA GAAGATGATC GCGAACATCG CCAGCGCCGG 6 0 

CCAGGCTATG GTGCCGGTGA TGGCCGACCA GCCGATCATC ACCGGCATAC AGCCGGCCGC 120 

L. CCCACCCCAC ACCACGTTCT GTGACGTGCG TCGCTTGAGC CAAAGCGTGT AGACRAACAC 180 

ATAAAACGCG ACGGTGACCA GGGCCAGCAC CCCCGCCAGC AGGTTCGTGG CGCAC CAT AG 24 0 

LH CCAGAAGAAC GAGATCACCG TCNACGTCAC CCGAGTGCCA ACGCGTTTCG GGTCGGCACC 3 00 

GCTTCCCGCG CCAAGGGCCG GCGCGCGGTT CGCTTCATCA CCTTGTCGAT ATCGGCGTCG 360 

GCNACCAGTT GAGCGTGTTG GCGCCGGCGG CSGCCATCAT CCCGCCGACN ANCGTGTTGA 420 

Q GCATGANCAG CGGATGAATG GCGCCGCGGC TCGTGCCGCT CGTGCCGAAT TCAACTCCGT 480 

IU CNACAACTTG CGGNCGCACT CGAACCCGGG TGAATGAWTG AATTTAAACC GSTSAACANT 540 

AACTACATAA CCCTTGGGGG CTCTTAACCG GTYYTGAANG GGTTTTTTGC TTAAAGGAAG 6 00 

AACYATTTCC GGATANCTGG CSTTNWTARC GAAAAGGCCC CRCCCATNGC CCTCCACAGT 660 

TTSCCCCTGA ATGGSAATGG MNCNCCYKNR CNGGGNCTTT AACRCSGGCG GGNTTTTGKT 720 

MCCCNNCTKA CNTTMMMTGC ARNNCNGGCC SKCCCTTCCK TNTYCCCTCC NTCCCCCNST 780 

TNCNGKTCCC CNNAMNYTNW ACGGGGGGCC YTNGGGKCRM TWTKKTTTGG GCCCCMCCCC 840 

MAAANASAAN GGGGKRNGTY CSTTTGGCNC CCCAMAARGG NYCCCCCCAM YTNRRKMCSY 900 

CNNTNKGGNN CTGTNCKNCG GAARAMAMCC KCCCCGNSTS STTNGTYWAG GNRWKGNSRG 960 

CCSCCCCGGY MNNNAAYAWN WMNATNCNNS STNANMAKKN NNNNNNNSCN WNGNGNNTCN 1020 

SCNSNGGKBC CSCC 1034 

(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 
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AATT CGGCAC GAGCCCACAT CCGGGGCCGC TCGTTGCATG ACTCGTTCGT CATCGTCGAC 60 

RAGGCACAGT CGCTGGAGCG CAATGTGTTG CTGACCGTGC TGTCCCGGTT GGGGACCGGT 12 0 

TCCCGGGTGG TGTTGACCCA CGACATCGCC CAGCGCGACA ACCTGCGGGT CGGCCGCCAC 180 

GACGGGTCGC CGCGGTGATC GAGAAGCTCA AAGGTCATCC GTTGTTCGCC CACATCACCT 240 

TGCTGCGCAG TGAGCGCTCG CCGATCGCCG CGCTGGTCAC GAGATGCTCG ANGAGATCAC 3 00 

CGGGCCGCGC TGAGTGCGCC TCCCGCGAGC A - 3 31 

(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1026 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.319: 

AATTCGGCAC GAGATCGTCA CCCTGGCGAC CAGTGCACCC AGGCCACGCC AC C AGTTACG 60 

GCTGATGGGC CAGAAGATGG ACCAGGTGCT GCCCATCCCG CCCACCGCAC TGCAGCTGAG 120 

CACCGGGATC GCGGTCCTCA GCTACGGCGA TRAGCTGGTG TTCGGCATCA CCGCTGACTA 180 

TGACGCCGCG TCCGAAATGC AGCAGCTGGT CAACGGTATC GAACTGGGTG TGGCGCGTCT 240 

GGTGGCGCTC ANCGACAATT CCGTGCTGCT GTTTACAAGG ATCGGCSTAA GCGTTCATCC 3 00 

CGCGCACTCC CCANCGCCGC GCGGCSGGGG CGGCCCTCTG TGCCGACCGC CCGAGCGCGT 360 

CACTGACGCC ATCTCCGTCG GCGTTAACCC CGTGAGAAGG TGGGTCGTGC GCAAGTTGGG 42 0 

CCCGGTCACC ATCNATC CGC GCCGCCATGA CGCNGTGCTG TTCCACACCA CNTSNGACNC 48 0 

CCCCCAGGAA CTGGTCCGGC AMTNCAGGAA NTYCGTGTGG GCACCNGCTT CTTCCGKTRT 54 0 

GGCYTAAACT TCCNATS TTN CSGCSGGCCT CTGGCGTTNC GNCCGGGCCG NTCTTNCCAA 600 

ATCGGSMMAA ATCCCCANMC AAACCCCCCG GGTCTTGSGG GCSGGGNGGC GGCCNAWNCC 66 0 

AAACCCCCCC NTTAAANTCT TTGKTNCCNN CNCSGGCNCC NCNAANSCAN CCCTTTKGGC 72 0 

NCTTCCCCCC CCCAWTTTAA CCGAKCGSCN AAYCCCAAGY TMMGKCCYCY KNAAAAAAAA 78 0 

AATTTGSCSG CCCCAANTAA ATTCCCNGGC CCYTTGGGGG CGRANCNYNT TTTMCCSNSS 84 0 

TKGNNNAAMC NGGANCCSGG KAAYTMMTKG NAAYCGCCSN AAMBNTTTTC TAANNCCCCN 900 

YNCCCSGAAA ATTNNAMAAM CMNNKTGSNG GGGGKTTSNC SGKKGRAGGM AAAAAANRSN 960 

SKTTNMCNNN SANMNCNSNN SGGNSNNNNN NNNCNCGYKC CSNAANMCCC CGCGGGGGGG 1020 

CCMMCC 1026 

(2) INFORMATION FOR SEQ ID NO: 32 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

AATTCGGCAC GAGAAGACGC CCGARNGTST GCGCTGGCTC TACAACTTCA TCAARGCGCA 6 0 

NGGGGAACGC AACTTCGGCA AGATCTACGT TCGCTTCCCC GAAGCGGTCT CGATGCGCCA 120 

GTACCTCGGC GCACCGCACG GCGAGCTGAC CCAGGATCCG GCCGCGAAAC GGCTTGCGTT 180 

GCAGAAGATG TCGTTCGAGG TGGCCTGGAN GATTTTGCAN GCGACGCCNG TNACCGCGAC 240 

GGGTTTKGTG TCCGCACTGC TGCTCACCAC CCGCSGCACC GCGTTGACGC TCGACCAGCT 3 00 
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GCACCACTCG TGCCGCTCGT GCCG 324 
(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

AATTCGGCAC GANGCGTGCC GCTNAACACC AGCCCGCGGC TGC CAGATAT CCCGGACTCG 60 

GTAGTGCCGC CGGTGGCGTC GTTGCTCTCC TGACGGGGCG CGGCGACCAT AAGGTCGCTM 12 0 

ATGCCCAGGT AGCGGCCCAG GTGCATGGAG TCGATGATGA TGCGACTCTC CAGCTCGCCG 18 0 

ACCGGGAGCT TGGCATCGGG CCTGATCAGC CAGGACGCGT AGGACAAGTC GATCGAATGC 240 

ATAGTGGCCT CCAGAGTGGC CGTGC AMTT C CNGCGTGCTC CACGGCAAAT GCCTTGATTT 300 

CTACTCCGCG TANTGTTCCC GCATCGCCTG CGGGATGAAT GGGAAC CGCA SGATGGCGAC 36 0 

GAACGGGTCT GANCTCAGGT TTGCCGCTTT GCGCACAGTG GTCNACANCC GGTACTCGGC 420 

ATANATCTGG CCCNAAATCG GCGCCGACGG CGCCCACNAT AANAACGGGC ACNACAATCG 480 

CCGCCCCGGT CACCCNAACA ACANCTTGSC ATCGGATTTT GTCCCCANCG CTCAANCCGT 540 

CCCGAACGCC TCNTCCGGCG NACTTTTCTT NNAWTAACTG CCGCTTCCGK CCCTGGNGCA 600 

WTAAATGGGA AACCCTTNCC CCACCTTGAA GGGGTTGTTG NATTTTTACT GSTAACCCCG 660 

AATTNTTCCG GANTCGGTCN KCCGGGSTTT YSTNTTCCCC ACCTTNGNAN GGGCCGGCCA 720 

AGSTTTTCTT SYTGAAGGGG GAAACCCAAC TTTNTYTYYN AACCSCMNAA MYMTTTYCSG 780 

MNAASCCNKT CCCCTTTAAC CAMGGSGGTN AACCGKTMNG NGGKTAAAAA GGGSKNNKTG 840 

LH NCCCCYMANG GGGGGRAAAA TSTKTCNNCG GGGCCKAAAW ACCMMMMYGN GTGKKKNKSS 900 

fU GCSAAATTTT NMMRAACTKN GGGGCCSSGA NNTTTNAAAG MSCCCCCSNN GSTGKCCCNN 960 

111 NTTTCCNNAA WMKKGKNWNM SNMNSCSNGG GKYNSGGSNN NNAAGMGGGG 1010 

fy (2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

AATTCGGCAC GANGCGTGCC GCTNAACACC AGCCCGCGGC TGC CAGATAT CCCGGACTCG 6 0 

GTAGTGCCGC CGGTGGCGTC GTTGCTCTCC TGACGGGGCG CGGCGACCAT AAGGTCGCTM 12 0 

ATGCCCAGGT AGCGGCCCAG GTGCATGGAG TCGATGATGA TGCGACTCTC CAGCTCGCCG 18 0 

ACCGGGAGCT TGGCATCGGG CCTGATCAGC CAGGACGCGT AGGACAAGTC GATCGAATGC 240 

ATAGTGGCCT CCAGAGTGGC CGTGCAMTTC CNGCGTGCTC CACGGCAAAT GCCTTGATTT 300 

CTACTCCGCG TANTGTTCCC GCATCGCCTG CGGGATGAAT GGGAACCGCA SGATGGCGAC 360 

GAACGGGTCT GANCTCAGGT TTGCCGCTTT GCGCACAGTG GTCNACANCC GGTACTCGGC 420 

ATANATCTGG CCCNAAATCG GCGCCGACGG CGCCCACNAT AANAACGGGC ACNACAATCG 480 

CCGCCCCGGT CACCCNAACA ACANCTTGSC ATCGGATTTT GTCCCCANCG CTCAANCCGT 540 

CCCGAACGCC TCNTCCGGCG NACTTTTCTT NNAWTAACTG CCGCTTCCGK CCCTGGNGCA 600 

WTAAATGGGA AACCCTTNCC CCACCTTGAA GGGGTTGTTG NATTTTTACT GSTAACCCCG 660 
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AATTNTTCCG GANTCGGTCN KCCGGGSTTT YSTNTTCCCC ACCTTNGNAN GGGCCGGCCA 72 0 

AGSTTTTCTT SYTGAAGGGG GAAACCCAAC TTTNTYTYYN AACCS CMNAA MYMTTTYCSG 780 

MNAASCCNKT CCCCTTTAAC CAMGGSGGTN AACCGKTMNG NGGKTAAAAA GGGSKNNKTG 840 

NCCCCYMANG GGGGGRAAAA TSTKTCNNCG GGGCCKAAAW ACCMMMMYGN GTGKKKNKSS 90 0 

GCSAAATTTT NMMRAACTKN GGGGCCSSGA NNTTTNAAAG MSCCCCCSNN GSTGKCCCNN 96 0 

NTTTC CNNAA WMKKGKNWNM SNMNSCSNGG GKYNSGGSNN NNAAGMGGGG 1010 



(2) INFORMATION FOR SEQ ID NO: 323: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1092 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



M= (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

Q NGNGGGGWNS NTCAYCAYCA YCAC SGGGYW CWATTGCGGC CGCAWCTTGT MAASAGATCT 60 

m CGAAYTCGGC AMGAGGGAMT CKCTMGCNCC GCTGTGCAAN CCAATRAGGC CTRATAATTY 12 0 

J=: CCACTCCACA AAAAACCGTT GTGTGTAYYT SCCGRAAATR AAGGCGCCGG TNTCAACWYC 18 0 

m GCCGGTKTTY CCRATYCCCG TKTTGTAMCT GCCKGGGTSR AAAYCCCCGG TGTTGGAYCC 24 0 

CCGGATTGAA ACTGCCGGKT TGAAACTGCC GKTTTSGCSA TCCGGKWATT GAMSTCRCGG 300 

ATTAAAAAAC CGGKKTTGGN GCTGSNCGTG CCAAATNCGR AYCCRATAYC CCATGGCCTG 36 0 

KYCTYCTCCK YCGGTACCCA AAYCTGGGTA TCCTATACTG GYCCCTAAAK GCAAWYCKGG 42 0 

%\ GCTGYCMMTK TTGCKGGSGT CCNAATTTAS CACCASCGGT TCCTTC CAT A CCNAAACNCG 480 

iT= CKTGGGCWCC AGMCCGRAAA AAAKAATAAT RAKAAKGGTG CATNYCCAAA ACCNCCGCCN 54 0 

J*) CCCNANTNCN ATCCGNTNCC MSCNCCCCCA GCGGTNAAGK TKS GGAAYTT CTMMAACCCC 60 0 

I- CAAANCCCCA TAACNTNCGR GAASAAACCC CTYCNCGGGG GYCNWNCAAA ACASCNTTAT 66 0 

TTGCTKSTTT CGGGMWCCGT GCCGCCNAAA YCCCAAASTA CTTTYTGGGT CCNAGAKAAA 
5 ACCNCGGGCN CCMCCCSNAA NWTATYTCTT KGGCAANCCC CSAAACCTTR TCMNACCNCK 

FlJ ATRMTCCCTT CCCCVSCAAT TGGYCGGRAT NCGSNCCYTY TCAAAKKKSC CAKWWNNGNG 84 0 



720 
780 



900 



GRRNNACCMA ACCCCAAGTY CCMNAAAATN GKCCCCGCTC CNAACACGNK TYYTCCSAAA 

ASCCCWCCCC CCCCCCCRAA AACCCCCCNA RKANTNCCCA AAAACNYNGK GGCCCCCCCC 960 

CAAACMAAAA AMCCCCCSGM RMACSGGGGN NMCCCCGKKK KKTTTTCTTT TKCCMRSCCC 1020 

AAMGCAMWSY KSKTNMAAAA GGAAGRANCN TYCCSANANM TCCCNYWRSW CCGSWGMGNA 1080 

GAASMCCCCC CS 1092 



(2) INFORMATION FOR SEQ ID NO: 324: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 

GGGGGGGNNN NATACATCWT CYGTGYACCG GGGMTCTAKT GGCGGGCCGC AATCTNGTCA 
ASAGATCTCT NAMTTCGGGC ACAAAAACTW GACAAASYMT CGNGCNMTCC GTGTCCTNKA 
TCGCAAAACG NGTRACASAC ASACACRTAT GTGTGCCCAC CASCAAYTCK TTGGGACCTC 
GCTRACCGGY TGCCCRNACG CCACGYTGCS CWTCTATCCC RACGCCGGCC ACGGGYGGGG 
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ATATTCCAGG CACCACGCCC AGTTTGGTGG ACAATGCCCT GGCAKTTTCC TCRAANTTCG 300 

TGAAACCGAA TTCNSMTTGA ACCNCCAARG CCCCSNCCNR AACARTTGGG WTCCGCGGTT 360 

CTCCCCACCG KTTTCCGGGG GTNTCGGCAN AANCGCACCC WTGGWTTCTM TCNCCGCACC 420 

GGGCGGACAA NTCGGGTTGC AATTTTGCRA AYCGGGGCCG GGATTCCSCA AACGGGTGCC 480 

GAAACTGTTY YCRAAMACCG GGAKCCGCAA TTTCCGGGCR ANAAATTTCN YCNCACCACT 540 

GCTTRTACTT CCCCGACCGT AACMANTTTC ATCGTCNTNN CCTCTGCCCT TGGGGCAGGG 600 

CKAAAYACCG CMTTKGGTTT CGCAACCTGC GGCCCAANTC CCNAMCCRCA CTTT CNATTT 660 

GGNTCGAATT SCCCCCCGGT RANAACCSCC NTGGCCNNYT CGGASSAAAA NGGGCCCTNT 720 

KGGCNSCCCC AGTAANACCC TACCNNAYTS CAWTCTTTGC CAAASTTKGG ACGAANSKTG 780 

GGNTTCCGGK ATTTYYTTGS GGNCNCCCTN TATNGGSNTN GGGCCKCYNC NCSTKTGKCA 840 

NASSKAYCCS NGNKGGGGGT ACCCCCCTMG GGGGGTTTTT NSSGCCCCCC AWAYGNKSTG 900 

GCCCCCNNGG GGAAKAATWT MWWTMCNSGG GGGAAWTTTT NTSTGGAMCS SGGACYCCCR 960 

GGGGGKTTTT TCCCCCNCSA NNAWANGGGG GGGGGANAYT NTGNSGNGGG KWNTTTATTT 1020 

YTYYCYCCTM TKACMSGGGG GTTTKKAKNG GGGGGAGAAA ANAAAAAAAA RAKGGYKNTT 108 0 

TSKNCACNCT GKWNWNWANR NAGAGKTCCT CKCKCCNCSG SNTTTCTTTT MGNSGSYGGG 1140 

GNNGNNNAAA ACHKS RMMAC KCSYTYCCCG CGYCTCCTCC NCNGGGGYGS NGSCGNSTYN 12 00 

GNNKGRKWTA TNTMGNC GTN SCCTCCNCCC GCKNKNTGTC TMTCNMYGSG C 12 51 

(2) INFORMATION FOR SEQ ID NO: 325: 

jzj (i) SEQUENCE CHARACTERISTICS: 

i= J; (A) LENGTH: 10 99 base pairs 

^ (B) TYPE: nucleic acid 

™ (C) STRANDEDNESS : single 

+ (D) TOPOLOGY: linear 

m 

V (ii) MOLECULE TYPE: Genomic DNA 

O 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

HI AAYTCGGCAC MGAGTATCAC CAAKCTGYGT GGCCCAGCAA AGTGGAGCTA TTACTACCTG 6 0 

O TATGTGATCC TCRACATCTY CTCCCGCTAC KTGGTCGGGT GGATGGTGGC CTCGCKTGAK 12 0 

rlj TCRAAGGTCT TGGCCRAACG GCTGATCGCG CAAACCCTTG CGCCCAGCAC ATCAKCGCCG 18 0 

AACAGCTGAC CTGCMCGCCG AC CGGGGG YC GNCAATAACT CCAAACCGGT GGCMCTGCTG 240 

CTGGCCNACY CCGTGTCCCA ANTCGAACTC ASCCSGCNMA CCAKMAACKA NAACCGTTGT 300 

CTGAAGCCCA GTTCAAAAAC CTCAAGTWCC GGCCCRACTT CCCGAAACGG TNCGAGTCKA 360 

TCRSAGGSGG CCGGGTGCMC TGCAACCGGT TCTTCGGNTG GTRCAMCCCN AAAMCAAGCA 42 0 

TTCCGGGMTC CGMMTGCCCA CGCCGCCAAS TTTMCTACGG GCSGSCCNAT CAAATTCGCC 480 

GGGAACSGSN CCMCCKTCNK GGAMACGCCC TWCCAAAACC CYCGAACGGK ATCCTTCKGY 540 

NAACNCCCGA RCNCCCKSKT TCCGGGCTTC NMSGCGAATA CCCKNSCMNT CCGAATCCAA 60 0 

TTCCCMKYGG CTTTTYYYCC CCCCGGCCCC AAAYNGGGYC CCTASSNMKC KNCCAMNANT 66 0 

CCNWATCTGG NGGTCCCNAN KYYGGCGTTC NMAATSAMNA NMNRGGGTYT TSCYACCMMN 72 0 

AACCGKNNKG KCCCCMKCTK MANAAAKATT RATCAMKWNG GGNKCKCNCN NAAMACCSCN 78 0 - 

CNCYNCWYTC TMYCSSKWGC GCSMYNANCA SNGGGGAGGW GGSGRMKMCT CTMTCTCNCT 84 0 

MGCGCCKNTN TYCKSGAKAT ACASMNKTCC GCGCNGCGCN MAAMANRAKA CTAKCCGYGN 900 

CCSNSTMTYN CTSNNMKMNN TCCWMWNATC NTYYGKKCNN KCTMKATNWC CSCTSKCNCK 960 

MRAMTCKTYG SNMTCCTCCA TCNCTCKKSC SNMSKNTCKC KSCNCCNCWN CNKCNMKCWN 1020 

GGNSTCRCCY TCTMNNNTCS AGCKCGSKNC WACNCACACK NGWCTYTTCC WKNNMKCNKM 108 0 

TCKCKCACRG MTMTCWCCS 1099 

(2) INFORMATION FOR SEQ ID NO: 32 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 296 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

GNGNTATACA TCWCTGTGYA CCSAGGATCW ANTGCGGCCG MAAKCTWSTM CASAGATCTC 60 

AAAYTCTGCA MGAGCGGCAC AKAKYSTCGT CCMRACCCGG CAYACWCCWG CNCGCCCCWT 120 

CTTRGACCGG GGCKATASMC ACCGTTGGCC CCGGCNCGCA CCTACACCAC CCACGCCGCC 180 

AGCGCCCCCW TRAMCAAACC ACCCCGCKTT TACCGCCCGC GCCGCCGGGG CCACCACCAG 240 

CCCCACCGGC ACCACCGGCG CCGCCGTTGC CAAAACAGGC CCGCKTTTGC CACCRA 2 96 

(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

NGNGS GNKMY ATCATCWTTC TGCACCSNGG MTCWATTGCG GCCGCAATCT TSTMNASAGA 6 0 

TCTCGAAYTC GGCAMGARCA TCTGCGCGGN GAATGTCCAA AWGTCWKTAA CGGCMATCGG 12 0 

TTTGC CGYCA ACCACKCTRT SCAKATGCGG GCCAMWTYCA AACCRATTAT TTGGGYCGAG 18 0 

AAAATTTMCG CKTGTRASCA ACCTGCAGCG GGTCAASCAA CAGCCT CTRA AC CGTAAAT Y 240 

CKTAGGTNKT YCCGGCAACA AS C YCRATAA TSCGGCCCGC AMCCACAAAA CCTGANTNGT 3 00 

TNTTCNCRAA NCCGGTYCCC GRAGGGGTSA ACTGCS GTAR GCTTNTCWYC NCCTTRACAT 360 

TAAACCCCCC CGGNTCWTCG CCGCGCCCAA ATYCYTGCCC WTKGCNACCA YCCCANCCTG 420 

CSGTATGGTS RAANCASTSG GCRAACGGTM MCCSTACCKC TGGCTGATYC KTCGGNTCCS 480 

SNAATTCGGG GATTTACGGS CAMGGTTAAY CCAGGYCCCC TNTGCYTCKY CNACAACCSG 540 

ATCMWCNCCG TAC CTKTTAA AATTCTTTGT GGTGGAACCC AWYCKAAAAA NMTNTYCCCN 600 

TCCAMMGGGG CYCGGAAKKT CNACNTGGKT NACCCCTNCC YTTGAASTTT TCYTGNCCCC 660 

GGCCCKAAAS ANACCSGAKC CCCGGAAYCS WTAGGCYTCN TGCCCCSTTA AATTKGNCYC 720 

AATCCKCCAA CGCTCCCCGG GGTCSSCCMT TAAAMTTCCC CCCKSCASNG GAATYCYKSG 780 

GCWGTMATTW CCNCCCNTTT CYYGKNAAAC SCCCCCWKGN GSCTYCCCCN SNTTSSGCCS 840 

GGTTSGAMYC AAAAWTNGGG MMCNRAGNCG SGNAMCCSCN GKKGGGSATW TKAAYYCYGG 900 

GGGGGTCNYC CCCCRCSNAA AAGYGTKGGC KCCSSSCCYC CCMARTTTYT CNGGMRCMAM 960 

ACCANGGGNG CTCCCGTNCW WGGCTCCCSN SNSMAMAAAN NKCKCCKGGS CKGARRNMNA 1020 

MCTCSNGNGG WTCCCKNKTC NSCNSGNCGS YGGNSASWCC YNYCNCCACA ANC 1073 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1166 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 
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CGCCCCGTTC TTMMMTTCAY TCATTCACCG GGMTCTAGTG CGGCCGCAAK CTTGTCKACA 60 

GATCTCGAAY TCGGCAMGAS ACAATSTCGG GTKGGGCAAT GTCNGGTGGG GCAACTTTGG 12 0 

GCTCGGRAAT YCGGGGTTAA CGCCGGGTCT RATGGGTSTG GGTAATATCG GGTTTGGTAA 180 

TGCCGGCAGC TACAATTTCG GTTTGGCAAA ATATGGGTGT GGGCAATATN GGGTYCGCTA 24 0 

ACACCGSCAS TGGRAATTYC GGTATTSGGT NACCGGTRAY AAYCTGACCG GGTNCGGTGG 300 

TTYCAATACC GGTAACGGGA ATGTSGGTTS YYYACYCCGS GSAACGGNWW YTTNGKTCCT 360 

TMMCNCTSSM CCKSAAMTSM KMGGTSTYCT MTYCNNGGAS TAMTYNMCCC CCGWAYCKSC 42 0 

WAYCCCTCGT CATYCCMCMC SGSGYCCTCA MNCCACCYTG NGYYCCCTCC MKMTCYCAYT 480 

CMNTCCGGTW CCTNTMMNCC CSCNCRYCTC AMCNCTKSGK CACCNATMYC CSACKCHTCT 540 

MCYMCS CAKN MTTCCCCTCN CCTYTNNCCA MCMCSCTCTM TCMAACTCKC CCGGYCKCNC 600 

MYCTCTCKCC AYNMAACCKK TYCYWCNWYC YMYCKCKCAG WYKNMCTCCW ACTCTMYNTT 660 

TCTCTCNKCC CMKACCKNTT CTCWCSCCCC CCACAKAYMC YAWCMTMTCC MCTCKACSCC 720 

CYYCNNYCCM NMCWCMTCWC TWNAKCANCN TTCTTCTCTC MMYMTMACKC WCNNTCNCCK 780 

SGACCYTCTC ACTKMKCCKM TCTC CTTMCK CCYMWCNTCC MKYNCCCTCC NMTCMTCKYT 840 

CCTCNCNMRY CYYYAKCAKC NMCTCCCCAN KMC AKCTKC T CCCCCAKMKS ACNCKCCCWC 900 

CCTCCTATCC WCTCTCWCTY ATCTCKCTCW CNYCMYMKMC ACNCKCYAYT CNACTMNMWN 960 

CCANCNCTCT CTNYCTCWCK ACGTYCKCCK CTMCKCNYMC NRWCTYRCCT CKKCCNCCRN 1020 

CKNMCMKCTM CTCTCCWMKM TCCCWCCCAT CTMMKSTCTC WCNCMTCCCT CNKCCYNYNT 1080 

KCYTYCCMYG CTTCKNTCMT MCCWCCYATC TCTMKCCTCT CWCACYMCAC WMTTACWNCC 1140 

ACTCTCTRCW CKCCKCMCCR MTCTCB 1166 

(2) INFORMATION FOR SEQ ID NO: 32 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

NGNGGNNNNT CWTACATCWN TCTNCACCSG NGMTCWATTG CGCGCCGCAW NCTTGTMNAS 60 

AGAATCTCNN AAYTCGGCAC ANATGT CTTT TSTMTAKTGT GGCGGGGNGC CACGCCKTAT 12 0 

GTGYGCCTGG GYTRACCCAA CCCCGCGGCS CGGGCCRACC AGGCGGGGRA TSCAGGCCGC 180 

GGCGGCCGCG GCGGYTATAT RAAGCGCCGY TTTTKTRATA ACGGTSCCGC CGCCGGGTRA 240 

TTACGGGCAA AAYCGGKKTT TTGGGTRTAT AACGCTAATT GCAACCAWTT TTTYCGGGT C 300 

AAAAACYCGG CGWGCANATC NCGGGYCNCT RAGGCGCATT YMCGCCAAAA WTNTGGGCGC 360 

AAAACCCCKT TSYTATTTTN TGGGCTATSC GGYTGCTTCG GCAAACGCTY CCCGGGTTAA 420 

TCCCKTCCGC GGCGCCGCCN AAAAACCACC AATYCCGYTG GGGGTGKYCC CMCAGGCSGT 480 

TGCTYCGNGY CACCTGGCCA AAYYCCCAWT AKATTGGGTG SCYCKTSCGG TTSYTGGGCY 540 

CAATTACCCC CNCGGGNAAA GRRAAAANAA ATCNTCCNTT TGCTCGGYCA YCTTTMTTGG 600 

SAAAAGGGGC ATGGCS CGGT TYYTTTACCT CAAYCCCCNA NCANTWACCT YTCCSCCCGG 660 

GGGGNCANAA CGSTTNGCTC CGSGGNAKCC TKGTMCCCGN ATCNAAAGGC CNGAATTTGG 720 

TYYSSTYCNA ATTWTWKKKY CCCCWCNTTG YAAAAAKCCA AAASAKCCCK YCNCAMMYKT 780 

NGGGGTYSSG GCCKNYCTTK SNMTTAAACC CYCCCCAAAA YYNSGGGKKT TCCGCYNSAT 840 

KCCACCNCCK GNGGGGGGNA SAAAAAAAAY TTTYCCSAAA ATCCCACCYY TCYKTKS TRY 900 

AMACCCCCTT TYYMKKAYTC CKYS CNATTC SGMTTCWAAA TYCCGYGGCT TNTTCCCCCK 960 

CSGGNGCCCC AAWTTTGKTT YNCNANTTYC CCCNAAMNCM AWTMGGGGKS KCCATTCTGG 1020 

SCYTMAANTA AAANAANGGG NKTTTYYCTY MANAAACACN GTGKCNCNCN CNAAMAAASN 1080 

AKMARAKAGN KKKMTKNNSA AANCCNCCCC CTSTYTNYTT NKTNMNCKCC CYGGKKNKGM 1140 

SWSWYNTTCT NCCCRCCCCC YNYNKTGANA AAMMNCYCCS GGSTMCRNAN ASNMNTTTCK 1200 

STSTNGMGCC KMB AS NAN AN MCAMWKWYCC 123 0 
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(2) INFORMATION FOR SEQ ID NO: 33 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

NGNGGGKNNA TMAYCWTCTC ACSSGGTCTA TGCGGCGCAW CTMGTMAASA GATCTCNAAY 60 

TCGGCAMNAN GCATMTCMMC CATATATAAC CATTGCGTCS GYWTGCAWCT CRAAWCTGTC 120 

CTTCSKGCCG TTKTACRAAG GTGGMWTGYT CWTYCCTRAA SCCCTCRATC TCKTKTATYC 180 

CTKGGGCTYC ACTTTAACSG RATKSCTGCC TTKTAYCATT RATGCAAWTA WTGGYCRAWT 240 

jM. KTTGCAGGCC RACGGCWYCT TTTYCCGCRA GRACAATNGA TTGGAWYCGC TYCGCRAGGC 300 

Q CCGGCACCAR ACCGGGCNCC AAAGGYCCGC GCAAWTSCCT GGKTCAAAAA TGGTGCAAAC 36 0 

Q AAAMCNATCC CCGGYTTRAC CGCAGYTAMC ACAAKAAAAT TCCCWTGGCC GCACCAWNNT 420 

TTYCRATCWY CWYCCCCACC TTRAACTTGK YTGCSGTATT GCCTKCCTGC CTCRACAGCM 48 0 

jz YCNCCCKTCA AACCTGCGGT GACTCCAACT GGTCTGGYCG AASGGGGGYT CAMCGGACAA 54 0 

AACCCCRANN TCGCCAAATT TTCNCCCCCC CYCGGGAAAN GKTGATMTTC TCSNAACCSA 600 

CMGGGNNYTW NAACCCTGAA CSSSGSNKGA MYNSCCSGGA ANTTTTCCCT TYNGGGCGRN 660 

ill AAANCCTTTT AAGGTACCCC KGGNGGGGKG CCCYYTTGGG AAAACAACCC CKATTGGKTT 720 

TGGAAATNTT TKCNCCCCCA TTCNSGGGGG GGGCCCCAMC CCMMCTTTTN TCMS CNMTYY 78 0 

JU YCYYGGGAAT TNYTCGCCSG GAAYYCGGSM CCKGYCCTAA NCCCCMNWGG GKYSTGSNAR 840 

GGRATMAWWT TYS TTTYYMC CCGGCNNCCC CCCKAKMCNT KGNTGAACMA AAAKCSGGGG 900 

If GSCNMYMWYY YCNNNGNRTT TNRGGSSNMT TYMAAAMMAN GGGGKYWTYY CKCCNGSCNN 960 

Vi GKTYSGGGST TTTCCNTTTS GGGSSATYKG MACCCCKTMT AYC CGGGGGT NTKTKYCCCC 1020 

yi SC 1022 

o 

^ (2) INFORMATION FOR SEQ ID NO: 331: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

NNCGNNKNTA TAMAYCWYCT NCACCSGGGA TCWATTGCGG CCGCAATCTT STMAASAGAT 60 

CTCKAAYTCG GCAMGANCCG CAWCTATTTG KGTGRASCGC ACCAGCGRGA CCTCGCSGKT 120 

CKTTYCTTGC AGRGAGGCCK TGGGTGGCRC CGGTGGCAAT GCCAACCGCC CCCCAAAACN 180 

CCGCAAATMY CRAAAAACAA CCCSGGGGTA GKTCCSGGCC GCCAAATMAA TAACCGTKTT 240 

AACKCAGGCN ACGGCCAACC GGYCCCGCCC AACCAAGCNA CCTCCCCSCC NATAGGYCCG 3 00 

GTGGGGGCTG CCKTATYKCC AASTCGTCAY CTCNACGGGM CGGYCCMCWT TCCGCCTCAT 360 

CCGTCTCTCC TTMMATTTTC CRTCCACYKG GCGGGGAACY TTTTTNYCNC CCTTGSCMAN 420 

CACCNAAGGY CNAAAATTNC CCMTGCCKYG SNNCAAAYGR GATTGGGGTY CGKKTTTTNT 480 

TCNMCCMAAC CCCCNTTTNA CGCCCCMATC CCYTWATACC CCCWWMCMNS ANGKTTGNSA 540 

AAKTNNCCCC AAATRCCAAA MTTCTTCGCC NTTTMTWMCY YYCCTTTCCC CMCCCWNAAA 600 

GGSCCRCCYY TCGGGAANTY TCCCCNCAAA AWTCAMWCCM TTTCCCNCCA AGAAWTTCSG 66 0 
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SACTCCTTTN TTCNGGGNAM ATANATYYTT YCKTNGGGSK TTCCGMTCNC AMMAATNTCC 720 

RGGGKAAMCC AGKNTNNTCC YYYYCCCCAA NNTYCCYKGG RMCYNNYYCY TTAAANRASR 780 

SAACCCKSGG GKCYNCNCSS TARCCCCCAM KAAAATTTCC CCCSSKTTTC TYYNNKKMRW 84 0 

GCCCCCSAAM ACTMTWAYTT TCCCKCGNNN TTTSYCCKCS KCAMWMWMTG KKNCTTTTTT 900 

YCSCMATAMA CTTNGGKC CT NTCNYGSGCG CMAAANAAGG CGCGSTTCTN TTCWMAMACA 96 0 

YNTSGNMMMA SAAKAKWATA AWNNTRKKYK TKNNCCCNCC CKCKCTTSNN TNKCCMCSKS 102 0 

GGGKNWNKKR GWCTCCWCNC CKCCCNCKNK CCKWATMCCC CCCCSKCCGM NCMMNTTTKT 1080 



(2) INFORMATION FOR SEQ ID NO: 33 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



M= (ii) MOLECULE TYPE: Genomic DNA 

Q 

a (xi) SEQUENCE DESCRIPTION: SEQ ID NO:332: 

m 

=£ GGGGNNKYAT MCAYCWTCTS YACSGGGMNC TATTGCGGCC GCAWYTNGTM GASAGATCTC 60 

01 GAAYTCGGCA MGAAAAAAGW GATGTGCTGG ACCTTMCCGC GCGGGACGCR ACCRACAAAG 120 

£ RAASCGCGCC ANAATATTGG C C AC AKTTGG TCACATATTT ACCCAATTMT AYCAGGGAYT 180 

ill MCCATTCCKG GGACCRACCG CACAATCCCR ATSKTGGTTT GCRAACCCTR ACCGTCCCCA 240 

MYTYCGCCRA STTGAACCAG GGCRAAAAAA CGGCCRAAWY CTCGCCCTGA NTCCCGCTCS 3 00 

^ GCGCNAATAA CTAGGCCCAT TKAACGGAAC CGGNGGCCSC NANTTGGCCA AC AGGTC CTR 3 60 

m ACAAAGGGGC CCCASYYCGG CCGGWTCCCW TTYCACNCCC TNKTCTCKTG CCGAATYCGG 42 0 

J?1 WTCCRATNYC CCWTGGGCCT TKTCKYCKYC KYCGGTNCCA AWTCTNGGTA TNCTATRGKG 480 

?!! TCCCCTAAAT SCANATCTGG GCKYCCATTT NCTGG SNTTC NATTTAMMAN SRRCGGTTCT 54 0 

H TTCWTTCCRA AACCGSNTGG GCCCNNMCCA AAAAATGATN ATAATAATGK YGSCTTTCAA 6 00 

ACCCCGCCCC CCCATTCRWT CSGTTCCANC CCCCNGNGGT TAAGKTGGGA ATTTYTNAMC 660 

fy YCNARGCCCT NATTTSGGNA AAAACCYCYC GGGYCTCAAA CMNYTTTTTT GSKSSNTCGG 720 

GCTCRTTCSC CAAAACCCAA ATTNTYNYGG GGYCCKTNAA ACMCGGYCRC RCCGGAAATT 780 

TTTYTGGTTC AACCCCAACC TTTTCAASCC NTTTTYTYYT TRCCSSCSMN TNGSSGGGNT 840 

KSSCCNTTCY RARKKCCNMN GGGGGWYCYN CCCCRMNTTT CTTTTTTTTT CCGTNNMAAM 900 

NGKTTCTTCA AASMCCCCCC SCCCCCNSAA ACCCCCTNAR GTTTTYCMMA AANNWYNNGN 960 

KNCCCCCCCC MMNAAAAAAY YCSCCCGNRN ACSMSNGGGA MCCCCCGGSN NTTRKTTTTT 102 0 

TNCMSGYCCC CSRMASYYTT TKAMAMANRR GAMNSMTTTY TNNRGNWNK 1069 



(2) INFORMATION FOR SEQ ID NO: 333: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

NGNGGGGKWK MATACATCWT TCTTCACGSG GGATCWATTG CGGGCCGCAW TCTNGTMCAA 
SAGATCTCGA TYTCGGGCAM NACCCACCWC TCCRAAAAAA ACCCRAAWCT CGGGSKCTYC 
GARAAGTGTT GCCCGCKTTR AATTTAACAA ATTCAGTGTC ANAGTGTCAC GGCKTTACWT 
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YCCCGGCAAA GGGGCCACAA CCTGCAGRGA S CACYCRATG GKTGYTGKTS CNCGGGCGGG 240 

CCGGKTNAAG GGACCTGCCT GGGTKTGCSC TMCAAANATC WYCCGCGGGT YCGCTGGRAT 300 

MCNCAGGGGT GTCAAAAAAC CGCAAACAGG CACSCCANCC NTTTACGGGS CTTAAAANGA 360 

AAAAGGGCTG ATGCCCCCAA GGGGGCCCGC NCCCAACCTT CCGTTGGTCA ACAACCCGGT 420 

CTCTCKTGCC RAATCCGRWT CCRATNYCNC CWTGGCCTTK TCKYCTYCTY CGGTACCCAA 480 

ATCTGGGTAT CCTATASTGT CCCCTAAWTT CCAAATCTGG GCTGTCCATT TSCTTGGCNT 540 

TCCAAATTTA CCANCAACGG TTTCTTNCAT NCCAAAAACC GNTKGGCKCC NRACCCRAAA 600 

AAATGAATAA TAATAANNGG KCNNTTYCNA ACCNCCCCCC CCCNATTCCA TYSNGTTCCA 660 

NMNCCCCCAG NGGKTAGGTK GGGAAANYYC TCMAC CYYCA ANCCCTWARS TTTTNGRAAT 720 

KAAACCCTYC YCNGGGTCWW TYMAAAAAMA NTTATTTGGN NGNTTTCGGG MWNCKRKNST 780 

SCCAAAATCC MAAATANTTT YYTGGTYCNA TWAAAAAMCG YGNCCMNCCC GGAAAAWTTT 840 

TTNTGKTTSA ACCCCAAAAC YTTTTCMNAA NCSSKTTTTY CYTTCCCCCC AMNWTGGGYS 900 

GGGNATKGYG SCYTNTCTTA TKTKYTYMTW CMGGGGGGNN MKMTCMMCCC CCMTTTYYCY 960 

NYWRTTTTTN KCCCCKTNMR NNRAANNGGN YTCSYNANAA AAGCNCCCCC SCCKNCCCNA 1020 

AAAAWCCCCN NNNARAKTNT TTMKANNRMN SCKCNKNGKY YCCCCCCCWC YNMNNAAAAA 1080 

AATMYCCNCC RASANMCASM NMGGRGNRSC CCCCCCCSTT NNNNTMTTNT TTTTTTCSRA 1140 

GAGCKCCSCG MNNANMKNCK CTTTTTKCNC NNGNNGNGNN GGNGMNCKCC CCNAGAAMWK 1200 

CTKSTCCCKS 1210 

(2) INFORMATION FOR SEQ ID NO: 3 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 334: 

NGSSSNGNNA TMCATCWYCT GYACSGGGMT CWATTGCGGC CGCAACTNGT MAASAGATCT 60 

CGAAYTCGGC AAKANACACC ACCGCCGTGT MTATACACCG CAAATGTTCT GTKTGCCAAA 120 

ACCGAGACGC GCCGGCCGCG GGGYTCCAAC GCKTTACYTR ACCCGCCAGY TCAGTGTTRA 180 

AACCGGTGYT RAGGGCCGCA CCCAACWTAA ACGCTTTAKC CAAGRAWYTG GKTGGCCCGC 240 

AGCCACCTGY TGTGGYTGCC CTCWYCGGTG GTAGCGCCGG TTANCGCCGG TTGCGCGYTC 300 

AMCASCSCGC CGGTRATCCC AKCNWTCCCC CGGCCMRACC CACCGGGCAC TTTGRACGGT 360 

GCCGCCAATT CAAAYCKYCT GRWTCCTTCM AAACACCACR AAGGCCACCM CCMSCACCNA 420 

ATMGGGRACT TTAAGGCCCA GGCAAAACCT NTRAKCNCCT CCCGGGCRAA GGTCCSGCAA 480 

SCRATCCMAA AAAAKCKNAT TTCCCCCAGC AKCAACCCAA MMCGSTTTGC TGCTTCCGGA 540 

TTCGAAMCCA ATTMCWGGKT NCNWGGGAAA AACASCNNCC NWTAKCCMGG CCCMCGGGCA 600 

ATTTCS GRAA SAACCCCTNY CCCGGGTTTT YCCTGCTCMG GCCCAANACC CCCGGGAATC 660 

AAAAASGGTC GGNCAAANGG GCMAAACCCS SACCCMACTT WTTCCRCTTN GGGGGGSCWN 720 

CCKNGTTTAA AWKSCCTCYY CTSCCCAAAY TCGGKCMAAA NNGRKTTGGK TTNGGCNACC 780 

NTTTCCGGKC CCGGGKGKGK WGKYCTMNMA CSTTTNTTTT SCCCCYKAAA NYSCCCCCCC 840 

CGGSSCCCCG CCCGGGGGGA NNTTTTTAMA GKKTYCCCCT CCCCAMAAAA ANACCCCNYC 900 

CCSGGSCCCT TTKRWAAAMN KCTSCCCCNG GNNGGGGKCM GGKTTATTMT NNNCCSCCCC 960 

TCCGCGSAAA AAATAKMTTT SYCCCCCCNC CTCCKNCKNR GKAMSMSCGC TCCCYCTCNC 1020 

GCNKNTWAAN ARSNCCKKNN CCNCYKCCGS NSNGKCNWCD NCCSTSSNCT NKGCNCKNCN 1080 

KAAANAAYNC NGSMSTSSMN CNKCC 1105 

(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 936 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:335: 

NGSNSNKNNN TAMAYCWYYC TSCACSNGGA ACWANTGCGG CCRMAWCTNS TMKASAGATC SO 

TMGAAYTCGG CAAGAGCGGC AAGAGTGTGT GCATCTGGTC ANAGTSTMMA CRCGGTGCCG 120 

CSGGTGKGTR GASCACMCAT NTGCGRACAC CAAACCCKTC GCGGGYCACC GGCKTCGCCT 18 0 

GCAAAWYCCT CCAGGCCACC TCRAACAAYW YCTYC TGCAA CGCARGCCGT TYCGCGGCCG 240 

RATCCTGGKT CASYYCGCCK TGCGGTGCCC AAGKTACTGG CSCAYCAAAA CCGCTCCGGG 3 00 

RAACRAACKT AAWTYTGCCG AATTTCNTTC CCCTGCGCCT TGATAAATTT NTNAAGCCAC 360 

CGCAAMCCTY CGGGCKTCTC CTCKTGCCRA ATYCGRWTCC RATAYCGCCA TGGCCTNKTC 420 

KYCTYCKYCS GTACCCAAAT CTTGGGTATC CTATANTKYC CCWAAANRCA AWTCTGGGCK 48 0 

KTCCATKTSC TGGSKTCCRA ATTTAMMACA NCGGTTTCTT TCWTACCAAA AACCSNTGGG 540 

CCCCRACCRA AAAAKGATAA TAATAAKGTG CWWWCAAAAC CCCGCCCCCC RRTTCAAYCG 60 0 

GTCCARCACC CCANGNGGTN AGGTNGGAAT TYTMAACCCC CAGCCCATAA SNTTNSGNAA 660 

AAACCCCCCN GGGYMYCAAA AMMCTTTTTG GGGMTTCSGS CCATKGYKCC AAAACCAAAA 720 

TMTTTCYGGT CRWAAAAACC GGCCCNCCCG NAAATTTTTT GKCAACCCCA AACCTTTMAM 780 

CCNNNTTCYY YCCCNSACAA TNGGSGGNKN NGSSCNTTYT TWTTTYYNNA GGGGGGRRWC 840 

SNCCCCNAAN YYCCNAANKG NKCCCGSNMA AAAGAGANTT YCMKAAAAAC CCCCNCNCCC 900 

NAAAYACCCC MAAAKWTTCM AAASMSCNNG YCCCCC 936 

(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1042 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:336: 

NNNGNKNNNY ATMMAYTCWY YCTSCACCSG GGNNWCWATT GCGGCCRMAW KCTTGTMAAS 60 

AGATCTMNAA YTCGGCACAG ASSSGCACAG ASCCGCGGCG CTATYCMYCC GYTGCTCATG 120 

CTCAACACGC TCKTCGGCGW GRATAATGGC NCGCCGCCGG CGCCAACACG YTCAAYTGCT 18 0 

TCGCCAACGC CATATNTCAA CAAGGTRATA AAASCAAAAC CGCSCGCCGY GCCCTTGGGC 24 0 

S CGGRAAS CG GTGCCAACCC RAAACNCKTT GGGCACYCGG KTSRACTTTA AASGGTAATC 3 00 

TCKTCCTCCT GGGCTATGGT GCGCCACAAA CCTSYTGGCG WGGGTCTGGC CCTGGGYCAC 360 

CGYCRCNTTT TATNTNTCCK YCTACACNCT TKGGTYCAAC CAACCCACTT CACMAAATTG 420 

TTTTGGGKTG GGGS SGCCGG YTGTNNCCGK TAATAATCSG NTGKTC SGCC MYCACCGGWA 480 

CCATANCCTG GCCGGCSCTG GCAAATTTCC SAAATCATYT CCTTCTGRAC CCCCACAMRC 540 

CTNSAAATCC GRATCAATNC CCCNKGGCTT NTCYCTCTCN GTRCCCAATY TGGTTTCTAT 600 

RKTNCCCYAA TSCAATTGGS TTYCCRTTSC YGSTTCCAAN TTNACAAMAS GGTTTYTCMT 660 

ACCAAAACCC NTGGSCCNNA CMNAAAAKNA RAAAANAKGG KCTTTYAAAC CCCCCCCTAT 720 

TCAWYCGGTN CMRNWCCCCG NGKAAGGKGN GAAAYTTHRA CCCAANCCMT ARSTTSGNAK 780 

AAACCCYYCG GGGTSMCAAA MKNTWTTSSC CTTCGGMCTT YCCAAATMSA AAATYYTCKK 84 0 

KRMNAAAAMC YGNCCCCSAA ANATTTTTGT NAAMCCCKMA YYTRTTWMCC WTTTTCCYCC 900 

CCMCNNSNSG GNTNCC CTTY TYATTTCYMM MCRNNSGACN CCCCMNTYTT TWTTCKCWCN 960 

MMARGSNNYT RGRMMNMNCC CCNCCCCNAK MTCCNCAAAK NTTTNAACNN NNKYCKCCCC 1020 

CCCMWMNKNC CCCCMNCMTT TM 1042 
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(2) INFORMATION FOR SEQ ID NO: 33 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

NNSGSGMKKK ATAMATCWCT CTSYACCSNG GMTCWATTGC GGCCGMAWTC TNGTMAASAG 60 

ATCTCGAAYT CGGCAAANAK ACGCMAYGTC AAGTGTRAYY CGGTCACATA TCMTCGCGNG 120 

TCAACMCCAA AGCCGNGTCA CCGYCTCCCT GGGGCGC CAC CCCCATCGGT RATGCAACYT 180 

CGCGCGCCAC CGYCAAAAGG KTCWTTRAGG CGCTAAAGGT CAMCAATTCC TRAGGTYMCN 240 

M" CAC CGTTNTT TGGCCCGCCC RAWTYCTRAC CCGCAATWTC GGTAATCGGR AATTTGGGCW 3 00 

O YCGGCTTGGG CAATAAGKTN TTGGGCAACG GCGGRWTCYC NCTGGCCGRA ATTCCCNCAT 36 0 

O TCCKTTAACG GKTGRACCGT TTYCCCGGYT GCCGTAAYTG YTYCNTGGGC GCCYTCGGCC 42 0 

m CRNAGCASYY CRCTAACGGY CMCCAGGCAA TACCKTTGGC TTTRAACCAC CGGRATNAAY 4 80 

=JE TGKTACCCAC YTCAASSGTS CTGRANTTRK TNTCNTGRAA AANMCCACCN AACCCGGNTT 540 

08 RATCTGCTTC MTCANCWTTT SCCGGGTTCT GCCGTTTTGR AAYCTTNATC CMTYCAAAAG 600 

4S GTTTAMTTTC CCAANRAATT CGGYTTGCCA CCTTGGCCGS GGCTGGTTTM CGMWC CTTRR 66 0 

Ly AMATCCNCCS GCGGGSAAAN AMTTSGGNTT SGSCCGGTCC CCCGNAATAT YCNTGGNCCT 72 0 

GNAAATTGSS GGGATCCCCN GSGNAYCCGG CCWTKGGGGK TNCCCAGTTG GWACAATTYC 780 

Q WKCCGTTCCA AACCCGGGNC CGGGGGGTGG GSCCCNTTTT CCTMYNNAAA AAGKGTTTGN 840 

NYYTTTTCCG CNRAANTTCA CCSKCNKTNT GGNCCNAACY YYYCAANTTC CANACCTTTA 900 

m AASAAANCYK YGKTYYCCCC TTTTMCCSGS SANCCCCCCM NMSSKNCGGG AAAAAAAGNK 96 0 

TYNGCCTTAN CNSNKTKTTT TNKTYCCCCC NMWNNSNMCY NCBKKCNKRY NGNSNMNCCT 102 0 

tl MKYSKCNNNN SNNNNNKCGN GSNCSGMKYM CMNNCNGMYK NGNKSNNCCC MSC 1073 

iU (2) INFORMATION FOR SEQ ID NO: 33 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1061 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:338: 

GNSNGNKNTN TMCAYCWYCT SCACSGGGTC TATTGCGGCC GCAATYTNGT CKASAGATCT 6 0 

CGATYTCGGC AMNANAARTG TCGTCGTCAA TTTCAGKKTG GTCKTCAAAY GGGCCAGGCC 120 

GNGAC CRACA CCCTGNGTCA CCCAAAANAC CAACAGCWTC AAATWTCAAG GCCRAGGCSC 180 

TRTCAATYCC CRAS CAKTT A ACCGTKTCCW TCRAAGGTGC CRAACCAGGC ACCCAGYTCA 240 

CCGCCSGGCA AWTCGCGCTG CCGGCCGGTN TCAGCCTGAT TYCTGACCCT RWTCTGTSGG 3 00 

TGGYCAMCNT GGTGAAGGCC CWWCCGCCNA AGAACTGGAG GGCRAATTCC CAGGANCCNA 360 

GRAACCCNAG GAACCCGCGG TAKAANC CGG CRAAACCRAG GCCGYTGGCN ATTCCNATTA 420 

NAMS GGTTTG CRACNTGGCC RAACCGTTTY CTTGGTCGGC CTCGGCAACC CTGGAC CANT 480 

TACCCCKTNC CCGGNMCMAC CYCGGGTNCT TGKYCCCAAT NTGCYCCCGC GNRANTNGGC 540 

CNAATTCCAG GGCNCCANCT TTCCGGCCCN AATTCCCYTG GTTAATCACC GGGCNCNCCT 600 

GGTTTTGGGC AACCCCNCYS CTTMTTTAAA CATTCCGSCC CAAATGGGNC STTGGSAAAT 660 
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TCTNTYCGGT GGGGCSGGCR ANMYTTCTCT YCCCNAASAN CTTAMYC CAN TTCGSSNTCC 720 

CGGKCAAAWS NGGGGGGGNA AAGGGCCCCC CGGNTSCKCC GGGGKKGCCC CYGGKTTCAA 780 

AANTTTCSGG GKTSTMSCGG NVTCSCCCCC CSGCCAAGRA CCGNGGTTTT TTTTTGAACC 840 

KCMANTCSSA AMCCGCCSSC CCCMAAAGGS GCCTNAAWGR RAYTTNKSCC CNNAAACSGG 900 

CCCCCAKYTY SGGKTTCNNC CNCCS GKKGT CCMTSTTTMM MRCCCTTTGN GNKTTTTTAN 960 

MGSCCTTNNC CACCCCCYCK GGGKCSMNNA GAAKTMYWKC CNGGGGNNAN RSCCCCCCNN 1020 

GSGKGGGGKG MGAGYS CCKT CTKGCGNCNN YKNTTTCCCC C 1061 



(2) INFORMATION FOR SEQ ID NO: 33 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 986 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:339: 

GNNGNNNKWN ATMCAYCWYY CTSCACCSGG GMTCWATTGC GGCCGCAWKY TNGTMAAS AG 60 

ATCTMGAAYT CGGCACANAG CGGCACAGAG TGTGTGCATC TGTGTCANAG CTGTCAACGC 120 

GGTGCCGCSG GTGGTRASCA CMCATTGCGR AACACCAAAC CCGTCCGCGG GYCACCGGCK 180 

TCGCCTGCAA AAYCCTCCAG GCCACCYCRA AACAAYWYCT CCTGCAACSC ARSCCGTTYC 240 

GCGGCCGRAT CCTGGKYCAS YTCGCCKTGC GGTGCGCCAA GGTACTGGCS CWYCRANACC 30 0 

GCTYCGGGRA ACCNAACGTA AATCTTGCCN AATTTGCNTT CCCCCTSCCC TTRATNAATT 36 0 

TGTTAAACCA CGCAAACCTY CGGGCKTCTC CTCKTGCCRA WTCCGRWTCC RATNYCGCCA 42 0 

TGGCCTNKTC KYCTYCKYCS GTMCCCAAAT CTTGGTATCC TATATTGTCC CTAAATGCAA 480 

ATCTKGGCTG TCCATNTGCT GGCGTTCAAA TTWAMANCAG NGGTTTCTTY CTTCCNAAAC 540 

CCSTTGGCCC CAAACCNAAA AATGATNATA ATAATGGTGC TNTCAAACCC CGCNCCCATY 600 

CNATCSGKCC AMMCCCCRGN GGKTANKKGG GNAATTCTMM AACCCCAAGC CATAASNTTG 660 

SGANAAACCY NCNCMGGYCA CCAAAACANY NTTNTTGGNY SSNTTCGGMN YCATGGCTNN 720 

CMAAAACCCA AATACTNYYG GGYCCAATAA AAMMMSGGYC SAMCCGGAAA WTTTTYTTGN 780 

KYNAAACCNA AAKCCTTTTT CNAACCCDAN WNTYCCTNCC RCRCMANTGG CNSGGARTKT 840 

SSSCTTNCCA ATGKYCCMAA AGNGGGRANA CCARCCCCAA TTCCTNNNTN KNKNCCCNST 900 

TRNAAAAGGG GKNTYNCMAA AASCNCCNCC NCNCTCCCAA AAKAMCCCCN AAAGAKNTCN 960 

NAANASKYSN NNNSCCCCCC CCMMMN 986 



(2) INFORMATION FOR SEQ ID NO: 340: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1074 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:340: 



NGNGGGNKRN ATMMAYCWCT SATYYACCSN GGMNMWATTG CGGCCRMAWT CTNGTMKASA 60 

GATCTMGAAA YTCGGCAAAG AGYATKCTCG GGGGCCAGAT TTNTGGCCCG CAACCGCCGC 120 

ACTTTGCAYW TCAACAKTCC SGGTGCCCCA AAAAAWTCWT ACCCCCATMC TYCKTGCASM 18 0 

ASYTGCGCCC RATTRAACAC CCGGCCGGCW TGCTGCGCCA GGTATTYCAS CAGYTCAAAY 24 0 

YCTTTKTAGK TAAAATCCAG CSGGCGGCCA CNCAGCCGGG CGGTKTAGGT GCCTYCRTCA 300 
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ATMACCAGCY CGCCCAGGGY CACCTTGCCC AAAAYCTCCT GGGTCAGCCA AATTYCCGCS 360 

CCGGCCAACM ACCANCCGCA TYCTGGCNTC AATCYCACCG GGCCCGGTGY TAAAMMANMA 42 0 

GRATCTCKTC MANCCCCCAN TCAGCSYTNA CNGCMACAGC CCGCCTTCTT CAMACCGCCA 48 0 

RTACCGGGWT CAACCGGCCS GTCAAACTCA ACAGGCGGNC AGGCCTCCCC CGGANSAAAG 54 0 

GTCTTACSCC NNYAANAAAA MAAGNTCTGT TTTCCCCCTC CASAASNAAA AANCCCCSGC 600 

CGGGCCTTCN NMMGGGTTTG GGGMANANAA AARCNCCGGN GGAACGNATC CGAAAMCTCC 660 

CAAGTCNCMT TWAWAACYCN NNAACCCCCC ANTTTTGGGA AAGGNTCCCC NTTMYCCCCC 720 

TTTTASGKTS GGGMMYYCTY TAAAAAAATT CCCCAAAAAG CCCCGGGAAG GGTCMAMCTG 780 

GGNAAATTTC CAAMCCNWGK TTNTTYNGGT TMCGGGGGRA AATTYCNCTC CCYYNNNGGG 840 

CSSGSNNNAT TAYGGMSNMT TTTNNAAWTM NSGKKTSAMM YNNKCCMNNN SNNMSMANNK 900 

TNAMCKCCCN CCTCNGNGKY CSCYNCCCSG GNAGNGGRAS MKCCNANMAA AYASGNTTNK 960 

CGGAAMMCNN AATKGNNNSC CCGGASMCMN NNNMAAATMT CNCNKCNSNN AANRGMRACN 1020 

CCCNSNSGMN RRGAARMTNY YCCCCCGSKM GKGNKAAAAW GKYCCCCCCM AAAG 1074 

(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

NGNGNCNKNT MTACATCWTT CTGCACCSGG GNTCWANTGC GGCCGCAWKY TTGTCGASAG 60 

ATCTCGAAYT CGGCAMGAGG ACWCTCGCRA CGCCCCCACA NACTCTGGCG TGTGTACCCC 120 

ATTGNGCGCK TCACGCGCCC AYTGANCCAK TNCACTGGGG TGCCGTYCGC CKTGCGCGGC 180 

GGCCTCACGG CKCTSCWTCT RAAGGCWTGG CGCACCGCAT TCGGTTTTCT RAACGCTGGG 240 

AAAWTGGCCA GCCGTCTGGC TCATGGGNTC TACGCAACGC CNGCCCCCAA CRCTTTCTTA 3 00 

AATCCGGYCC NTCCTGANCS CTTTGAAYCC CGGGGSAAGA ACTGGTTGCS CNCGAYCTGC 3 60 

TCGAACTTRK TCNAAATCCC GCANAKTGTT TCNTAMGYCC CNCCGGAAGG NGAACCTACT 420 

TTCNGGWANG TCGGCNKCCG GCGCTTATCA STCCTGATCA ACGGGGAACT GGYKNNSTTG 480 

KGGGAAAAAG RRCCTCAATG MTYGGTCCKC GCTGCGKANC CGCSCCCTGK GYCGCNAATG 540 

GAAGGCSMAG GGTTAANGCC MTTYCNYCCR RSCCGTSTGA SGKWTTYCGG MGGANKAMNN 600 

NNKMAMWTTK TCRGNGGCCW ATSTSCCGGG CKSTTAKAGA ANACTYCCKW WCCGTNTYSC 660 

SAAAGNTKCS GCGMGTTTTS SCCKMGANGN YCTGATTTSA GGGGGKYKCC CCCGGGGTYC 720 

CGAAWKWRKY CCYAGGGGGM GNYCSAGCSC CGMNNATNAG AGNAAGGKTT RYGSTS KNCC 780 

TYTNKGGACC WSCNNCWSAK ANAACNNKKT TGCSCCNTMS AGNKTNKGRT YCCNKTSTTC 840 

TAAGAGGAGC TATKMKCGCC CKTGGANGMM GAGWGMGCGC KYCCCSNKRT TCNTNGWAAA 900 

TATKSAGMGG TKCCGMAGMK CCSCGTTTKT TKTGANAAMN MSMRKNKKTG CGMGYTCTSC 960 

GGGNTTTGTA GAGTAKTCGS CSCSSMWGAC WCSGMCMGNG AGKNKTNNTS YANTGARCGY 102 0 

MNNSKTMKMT MSCSCGCGNA GGAGNGCCCC CSANGMSTGY NKGGNMSSNG ARAKGATGGS 108 0 

GGCCNCGMNN MGMGGANMGA SANNGMGGMR GGGGGKTGKC TCKCSCCGNS CSANGRAGAA 1140 

GKTCNGSCGC CGMGGKYGKT KTKTKNKTGG YSTCMSSMMM NAGAAAAGAG AGGGC 1195 

(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

CCATCTGATC GTTGGCAACC AGCATCGCAG TGGGAACGAT GCCCTCATTC AGCATTTGCA 60 

TGGTTTGTTG AAAACCGGAC ATGGCACTCC AGTCGCCTTC CCGTTCCGCT ATCGGCTGAA 120 

TTTGATTGCG AGTGAGATAT TTATGC CAGC CAGCCAGACG CAGACGCGCC GAGACAGAAC 180 

TTAATGGGCC CGCTAACAGC GCGATTTGCT GGTGACCCAA TGCGACCAGA TGCTCCACGC 24 0 

CCAGTCGCGT ACCGTCTTCA TGGGAGAAAA TAATACTGTT GATGGGTGTC TGGTCAGAGA 300 

CATCAAGAAA TAACGCCGGA AC ATTAGTG C AGGCAGCTTC CACAGCAATG GCATCCTGGT 360 

CATCCAGCGG ATAGTTAATG ATCAGCCCAC TGACGCGTTG CGCGAGAAGA TTGTGCACCG 420 

CCGCTTTACA GGCTTCGACG CCGCTTCGTT CTACCATCGA CACCACCACG CTGGCACCCA 480 

GTTGATCGGC GCGAGATTTA ATCGCCGCGA CAATTTGCGA CGGCGCGTGC AGGGCCAGAC 540 

TGGAGGTGGC AACGCCAATC AGCAACGACT GTTTGCCCGC CAGTTGTTGT GCCACGCGGT 600 

TGGGAATGTA ATTCAGCTCC GCCATCGCCG CTTCCACTTT TTCC CGCGTT TTCGCAGAAA 660 

CGTGGCTGGC CTGGTTCACC ACGCGGGAAA CGGTCTGATA AGAGACACCG GCATACTCTG 720 

CGACATCGTA TAACGTTACT GGTTTCACAT TCACCACCCT GAATTGACTC TCTTCCGGGC 780 

GCTATCATGC CATACCGCGA AAGGTTTTGC GCCATTCGAT GGTGTCCGGG ATCTCGACGC 840 

TCTCCCTTAT GCGACTCCTG CATTAGGAAG CAGCCCAGTA GTAGGTTGAG GCCGTTGAGC 900 

ACCGCCGCCG CAAGGAATGG TGCATGCAAG GAGATGGCGC CCAACAGTCC CCCGGCCACG 960 

GGGCCTGCCA CCATACCCAC GCCGAAACAA GCGCTCATGA GCCCGAAGTG GCGAGCCCGA 1020 

TCTTCCCCAT CGGTGATGTC GGCGATATAG GCGCCAGCAA CCGCACCTGT GGCGCCGGTG 1080 

ATGCCGGCCA CGATGCGTCC GGCGTAGAGG ATCGAGATCT CGATCCCGCG AAATTAATAC 1140 

GACTCACTAT AGGGGAATTG TGAGCGGATA ACAATTCCCC TCTAGAAATA ATTTTGTTTA 1200 

ACTTTAAGAA GGAGATATAC ATATGGGCCA TCATCATCAT CATCACGTGA TCGACATCAT 1260 

CGGGACCAGC CCCACATCCT GGGAACAGGC GGCGGCGGAG GCGGTCCAGC GGGCGCGGGA 1320 

TAGCGTCGAT GACATCCGCG TCGCTCGGGT CATTGAGCAG GACATGGCCG TGGACAGCGC 1380 

CGGCAAGATC ACCTACCGCA TCAAGCTCGA AGTGTCGTTC AAGATGAGGC CGGCGCAACC 1440 

GAGGGGCTCG AAACCACCGA GCGGTTCGCC TGAAACGGGC GCCGGCGCCG GTACTGTCGC 1500 

GACTACCCCC GCGTCGTCGC CGGTGACGTT GGCGGAGACC GGTAGCACGC TGCTCTACCC 1560 

GCTGTTCAAC CTGTGGGGTC CGGCCTTTCA CGAGAGGTAT CCGAACGTCA CGATCACCGC 1620 

TCAGGGCACC GGTTCTGGTG CCGGGATCGC GCAGGCCGCC GCCGGGACGG TCAACATTGG 1680 

GGCCTCCGAC GCCTATCTGT CGGAAGGTGA TATGGCCGCG CACAAGGGGC TGATGAACAT 1740 

CGCGCTAGCC ATCTCCGCTC AGCAGGTCAA CTACAACCTG CCCGGAGTGA GCGAGCACCT 180 0 

CAAGCTGAAC GGAAAAGTCC TGGCGGCCAT GTAC CAGGGC ACCATCAAAA CCTGGGACGA 186 0 

CCCGCAGATC GCTGCGCTCA ACCCCGGCGT GAACCTGCCC GGCACCGCGG TAGTTCCGCT 192 0 

GCACCGCTCC GACGGGTCCG GTGACACCTT CTTGTTCACC CAGTAC CTGT CCAAGCAAGA 1980 

TCCCGAGGGC TGGGGCAAGT CGCCCGGCTT CGGCACCACC GTCGACTTCC CGGCGGTGCC 2040 

GGGTGCGCTG GGTGAGAACG GCAACGGCGG CATGGTGAC C GGTTGCGCCG AGACAC CGGG 2100 

CTGCGTGGCC TATATCGGCA TCAGCTTCCT CGACCAGGCC AGTCAACGGG GACTCGGCGA 2160 

GGCCCAACTA GGCAATAGCT CTGGCAATTT CTTGTTGCCC GACGCGCAAA GCATTCAGGC 2220 

CGCGGCGGCT GGCTTCGCAT CGAAAACCCC GGCGAAC C AG GCGATTTCGA TGATCGACGG 2280 

GCCCGCCCCG GACGGCTACC C GAT CAT CAA CTACGAGTAC GCCATCGTCA ACAACCGGCA 2340 

AAAGGACGCC GCCACCGCGC AGACCTTGCA GGCATTTCTG CACTGGGCGA TCACCGACGG 2400 

CAACAAGGCC TCGTTCCTCG ACCAGGTTCA TTTCCAGCCG CTGCCGCCCG CGGTGGTGAA 2460 

GTTGTCTGAC GCGTTGATCG CGACGATTTC CAGCGCTGAG ATGAAGACCG ATGCCGCTAC 2520 

CCTCGCGCAG GAGGCAGGTA ATTTCGAGCG GATCTCCGGC GACCTGAAAA CCCAGATCGA 2580 

CCAGGTGGAG TCGACGGCAG GTTCGTTGCA GGGCCAGTGG CGCGGCGCGG CGGGGACGGC 2640 

CGCCCAGGCC GCGGTGGTGC GCTTC CAAGA AGCAGCCAAT AAGCAGAAGC AGGAACTCGA 2700 

CGAGATCTCG ACGAATATTC GTCAGGCCGG CGTCCAATAC TCGAGGGCCG ACGAGGAGCA 2760 

GCAGCAGGCG CTGTCCTCGC AAATGGGCTT TGGATTCAGC TTCGCGCTGC CTGCTGGCTG 2820 

GGTGGAGTCT GACGCCGCCC ACTTCGACTA CGGTTCAGCA CTCCTCAGCA AAACCACCGG 2880 

GGACCCGCCA TTTCCCGGAC AGCCGCCGCC GGTGGCCAAT GACACCCGTA TCGTGCTCGG 294 0 

CCGGCTAGAC CAAAAGCTTT ACGCCAGCGC CGAAGCCACC GACTC CAAGG CCGCGGCCCG 3000 

GTTGGGCTCG GACATGGGTG AGTTCTATAT GCCCTACCCG GGCACCCGGA TCAACCAGGA 3060 
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AACCGTCTCG 


CTYGACGCCA 


ACGGGGTGTC 


TGGAAGCGCG 


TCGTATTACG 


AAGTCAAGTT 


3120 


CAGCGATCCG 


AGTAAGCCGA 


ACGGCCAGAT 


CTGGACGGGC 


GTAATCGGCT 


CGCCCGCGGC 


3180 


GAACGCACCG 


GACGCCGGGC 


CCCCTCAGCG 


CTGGTTTGTG 


GTATGGCTCG 


GGACCGCCAA 


3240 


CAACCCGGTG 


GACAAGGGCG 


CGGCCAAGGC 


GCTGGCCGAA 


TCGATCCGGC 


CTTTGGTCGC 


3300 


CCCGCCGCCG 


GCGCCGGCCG 


GGGAAGTCGC 


TCCTACCCCG 


ACGACACCGA 


CACCGCAGCG 


3360 


GACCTTACCG 


GCCTGAGAAT 


TCTGCAGATA 


TCCATCACAC TGGCGGCCGC 


TCGAGCACCA 


3420 


CCACCACCAC 


CACTGAGATC 


CGGCTGCTAA 


CAAAGCCCGA 


AAGGAAGCTG 


AGTTGGCTGC 


3480 


TGCCACCGCT 


GAGCAATAAC 


TAGCATAACC 


CCTTGGGGCC 


TCTAAACGGG 


TCTTGAGGGG 


3540 


TTTTTTGCTG 


AAAGGAGGAA 


CTATATCCGG 


AT 






3572 



(2) INFORMATION FOR SEQ ID NO: 343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

Val Gin Phe Gin Ser Gly Gly Asp Asn Ser Pro Ala Val Tyr Xaa Xaa 

15 10 15 

Asp Gly Xaa Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10- amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

Thr Thr Val Pro Xaa Val Thr Glu Ala Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO : 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:345: 
Thr Thr Pro Ser Xaa Val Ala Phe Ala Arg 
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15 10 

(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:346: 

Asp Ala Gly Lys Xaa Ala Gly Xaa Asp Val Xaa Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 347: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:347: 

Thr Xaa Glu Glu Xaa Gin Glu Ser Phe Asn Ser Ala Ala Pro Gly Asn 

15 10 15 

Xaa Lys 



(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 
CTAGTTAGTA CTCAGTCGCA GACCGTG 

(2) INFORMATION FOR SEQ ID NO: 34 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

GCAGTGACGA ATTCACTTCG ACTCC " 25 
(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2412 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

CATATGGGCC ATCATCATCA TCATCACGTG ATCGACATCA TCGGGACCAG CCCCACATCC 60 

TGGGAACAGG CGGCGGCGGA GGCGGTCCAG CGGGCGCGGG ATAGCGTCGA TGACATCCGC 12 0 

GTCGCTCGGG TCATTGAGCA GGACATGGCC GTGGACAGCG CCGGCAAGAT CACCTACCGC 180 

ATCAAGCTCG AAGTGTCGTT CAAGATGAGG CCGGCGCAAC CGAGGGGCTC GAAACCACCG 240 

AGCGGTTCGC CTGAAACGGG CGCCGGCGCC GGTACTGTCG CGACTACCCC CGCGTCGTCG 3 00 

CCGGTGACGT TGGCGGAGAC CGGTAGCACG CTGCTCTACC CGCTGTTCAA CCTGTGGGGT 360 

CCGGCCTTTC ACGAGAGGTA TCCGAACGTC ACGATCACCG CTCAGGGCAC CGGTTCTGGT 420 

GCCGGGATCG CGCAGGCCGC CGCCGGGACG GTCAACATTG GGGCCTCCGA CGCCTATCTG 48 0 

TCGGAAGGTG ATATGGCCGC GCACAAGGGG CTGATGAACA TCGCGCTAGC CATCTCCGCT 540 

CAGCAGGTCA ACTACAACCT GCCCGGAGTG AGCGAGC AC C TCAAGCTGAA CGGAAAAGTC 600 

CTGGCGGCCA TGTACCAGGG CACCATCAAA ACCTGGGACG AC CCGCAGAT CGCTGCGCTC 660 

AACCCCGGCG TGAACCTGCC CGGCACCGCG GTAGTTCCGC TGCACCGCTC CGACGGGTCC 720 

GGTGACACCT TCTTGTTCAC CCAGTACCTG TCCAAGCAAG ATCCCGAGGG CTGGGGCAAG 780 

TCGCCCGGCT TCGGCACCAC CGTCGACTTC CCGGCGGTGC CGGGTGCGCT GGGTGAGAAC 840 

GGCAACGGCG GCATGGTGAC CGGTTGCGCC GAGAC AC CGG GCTGCGTGGC CTATATCGGC 900 

ATCAGCTTCC TCGACCAGGC CAGTCAACGG GGACTCGGCG AGGCCCAACT AGGCAATAGC 96 0 

TCTGGCAATT TCTTGTTGCC CGACGCGCAA AGCATTCAGG CCGCGGCGGC TGGCTTCGCA 1020 

TCGAAAACCC CGGCGAACCA GGCGATTTCG ATGATCGACG GGCCCGCCCC GGACGGCTAC 1080 

CCGATCATCA ACTACGAGTA CGCCATCGTC AACAACCGGC AAAAGGACGC CGCCACCGCG 1140 

CAGACCTTGC AGGCATTTCT GCACTGGGCG ATCACCGACG GCAACAAGGC CTCGTTCCTC 1200 

GAC CAGGTTC ATTTCCAGCC GCTGCCGCCC GCGGTGGTGA AGTTGTCTGA CGCGTTGATC 1260 

GCGACGATTT CCAGCGCTGA GATGAAGACC GATGCCGCTA CCCTCGCGCA GGAGGCAGGT 132 0 

AATTTCGAGC GGATCTCCGG CGACCTGAAA ACCCAGATCG ACCAGGTGGA GTCGACGGCA 138 0 

GGTTCGTTGC AGGGCCAGTG GCGCGGCGCG GCGGGGACGG CCGCCCAGGC CGCGGTGGTG 144 0 

CGCTTCCAAG AAGCAGCCAA TAAGCAGAAG CAGGAACTCG ACGAGATCTC GACGAATATT 1500 

CGTCAGGCCG GCGTCCAATA CTCGAGGGCC GACGAGGAGC AGCAG CAGGC GCTGTCCTCG 156 0 

CAAATGGGCT TTGTGCCCAC AACGGCCGCC TCGCCGCCGT CGACCGCTGC AGCGCCACCC 1620 

GCACCGGCGA CACCTGTTGC CCCCCCACCA CCGGCCGCCG CCAACACGCC GAATGCCCAG 168 0 

CCGGGCGATC CCAACGCAGC ACCTCCGCCG GCCGACCCGA ACGCACCGCC GCCACCTGTC 174 0 

ATTGCCCCAA ACGCACCCCA ACCTGTCCGG ATCGACAACC CGGTTGGAGG ATTCAGCTTC 180 0 

GCGCTGCCTG CTGGCTGGGT GGAGT CTGAC GCCGCCCACT TCGACTACGG TTCAGCACTC 1860 

CTCAGCAAAA CCACCGGGGA CCCGCCATTT CCCGGACAGC CGCCGCCGGT GGCCAATGAC 192 0 

AC CCGTATCG TGCTCGGCCG GCTAGACCAA AAGCTTTACG CCAGCGCCGA AGCCACCGAC 198 0 

TCCAAGGCCG CGGCCCGGTT GGGCTCGGAC ATGGGTGAGT TCTATATGCC CTACCCGGGC 204 0 

ACCCGGATCA ACCAGGAAAC CGTCTCGCTC GACGCCAACG GGGTGTCTGG AAGCGCGTCG 2100 

TATTACGAAG TCAAGTTCAG CGATCCGAGT AAGCCGAACG GCCAGATCTG GACGGGCGTA 2160 
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ATCGGCTCGC CCGCGGCGAA CGCACCGGAC GCCGGGCCCC CTCAGCGCTG GTTTGTGGTA 2220 

TGGCTCGGGA CCGCCAACAA CCCGGTGGAC AAGGGCGCGG CCAAGGCGCT GGCCGAATCG 2280 

ATCCGGCCTT TGGTCGCCCC GCCGCCGGCG CCGGCACCGG CTCCTGCAGA GCCCGCTCCG 2340 

GCGCCGGCGC CGGCCGGGGA AGTCGCTCCT ACCCCGACGA CACCGACACC GCAGCGGACC 2400 

TTACCGGCCT GA 2412 

(2) INFORMATION FOR SEQ ID NO: 3 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 02 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

Met Gly His His His His His His Val He Asp He He Gly Thr Ser 

15 10 15 

Pro Thr Ser Trp Glu Gin Ala Ala Ala Glu Ala Val Gin Arg Ala Arg 

20 25 30 

Asp Ser Val Asp Asp He Arg Val Ala Arg Val He Glu Gin Asp Met 

35 40 45 

Ala Val Asp Ser Ala Gly Lys He Thr Tyr Arg He Lys Leu Glu Val 

50 55 60 

Ser Phe Lys Met Arg Pro Ala Gin Pro Arg Gly Ser Lys Pro Pro Ser 
65 70 75 80 

Gly Ser Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro 

85 90 95 

Ala Ser Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr 

100 105 110 

Pro Leu Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn 

115 120 125 

Val Thr He Thr Ala Gin Gly Thr Gly Ser Gly Ala Gly He Ala Gin 

130 135 140 

Ala Ala Ala Gly Thr Val Asn He Gly Ala Ser Asp Ala Tyr Leu Ser 
145 150 155 160 

Glu Gly Asp Met Ala Ala His Lys Gly Leu Met Asn He Ala Leu Ala 

165 170 175 

He Ser Ala Gin Gin Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His 

180 185 190 

Leu Lys Leu Asn Gly Lys Val Leu Ala Ala Met Tyr Gin Gly Thr He 

195 200 205 

Lys Thr Trp Asp Asp Pro Gin He Ala Ala Leu Asn Pro Gly Val Asn 

210 215 220 

Leu Pro Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly 
225 230 235 240 

Asp Thr Phe Leu Phe Thr Gin Tyr Leu Ser Lys Gin Asp Pro Glu Gly 

245 250 255 

Trp Gly Lys Ser Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val 

260 265 270 

Pro Gly Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys 

275 280 285 

Ala Glu Thr Pro Gly Cys Val Ala Tyr He Gly He Ser Phe Leu Asp 
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290 295 300 

Gin Ala Ser Gin Arg Gly Leu Gly Glu Ala Gin Leu Gly Asn Ser Ser 
305 310 315 320 

Gly Asn Phe Leu Leu Pro Asp Ala Gin Ser lie Gin Ala Ala Ala Ala 
325 330 335 

Gly Phe Ala Ser Lys Thr Pro Ala Asn Gin Ala lie Ser Met lie Asp 

340 345 350 

Gly Pro Ala Pro Asp Gly Tyr Pro lie lie Asn Tyr Glu Tyr Ala lie 

355 360 365 

Val Asn Asn Arg Gin Lys Asp Ala Ala Thr Ala Gin Thr Leu Gin Ala 

370 375 380 

Phe Leu His Trp Ala lie Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp 
385 390 395 400 

Gin Val His Phe Gin Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp 

405 410 415 . 

Ala Leu lie Ala Thr lie Ser Ser Ala Glu Met Lys Thr Asp Ala Ala 

420 425 430 

Thr Leu Ala Gin Glu Ala Gly Asn Phe Glu Arg lie Ser Gly Asp Leu 

435 440 445 

Lys Thr Gin He Asp Gin Val Glu Ser Thr Ala Gly Ser Leu Gin Gly 

450 455 460 

Gin Trp Arg Gly Ala Ala Gly Thr Ala Ala Gin Ala Ala Val Val Arg 
465 470 475 480 

Phe Gin Glu Ala Ala Asn Lys Gin Lys Gin Glu Leu Asp Glu He Ser 

485 490 495 

Thr Asn He Arg Gin Ala Gly Val Gin Tyr Ser Arg Ala Asp Glu Glu 

500 505 510 

Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe Val Pro Thr Thr Ala 

515 520 525 

Ala Ser Pro Pro Ser Thr Ala Ala Ala Pro Pro Ala Pro Ala Thr Pro 

530 535 540 

Val Ala Pro Pro Pro Pro Ala Ala Ala Asn Thr Pro Asn Ala Gin Pro 
545 550 555 560 

Gly Asp Pro Asn Ala Ala Pro Pro Pro Ala Asp Pro Asn Ala Pro Pro 

565 570 575 

Pro Pro Val He Ala Pro Asn Ala Pro Gin Pro Val Arg He Asp Asn 

580 585 590 

Pro Val Gly Gly Phe Ser Phe Ala Leu Pro Ala Gly Trp Val Glu Ser 

595 600 605 

Asp Ala Ala His Phe Asp Tyr Gly Ser Ala Leu Leu Ser Lys Thr Thr 

610 615 620 

Gly Asp Pro Pro Phe Pro Gly Gin Pro Pro Pro Val Ala Asn Asp Thr 
625 630 635 640 

Arg He Val Leu Gly Arg Leu Asp Gin Lys Leu Tyr Ala Ser Ala Glu 

645 650 655 

Ala Thr Asp Ser Lys Ala Ala Ala Arg Leu Gly Ser Asp Met Gly Glu 

660 665 670 

Phe Tyr Met Pro Tyr Pro Gly Thr Arg He Asn Gin Glu Thr Val Ser 

675 680 685 

Leu Asp Ala Asn Gly Val Ser Gly Ser Ala Ser Tyr Tyr Glu Val Lys 

690 695 700 

Phe Ser Asp Pro Ser Lys Pro Asn Gly Gin He Trp Thr Gly Val He 
705 710 715 720 

Gly Ser Pro Ala Ala Asn Ala Pro Asp Ala Gly Pro Pro Gin Arg Trp 
725 730 735 



273 



Phe Val Val Trp Leu Gly Thr Ala 
740 

Ala Lys Ala Leu Ala Glu Ser lie 
755 760 
Ala Pro Ala Pro Ala Pro Ala Glu 

770 775 
Gly Glu Val Ala Pro Thr Pro Thr 
785 790 
Pro Ala 



Asn Asn Pro Val Asp Lys Gly Ala 
745 750 
Arg Pro Leu Val Ala Pro Pro Pro 
765 

Pro Ala Pro Ala Pro Ala Pro Ala 
780 

Thr Pro Thr Pro Gin Arg Thr Leu 
795 800 



(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 
GGATCCAAAC CACCGAGCGG TTCGCCTGAA ACGG 34 
(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 53: 
CGCTGCGAAT TCACCTCCGG AGGAAATCGT CGCGATC 3 7 

(2) INFORMATION FOR SEQ ID NO: 3 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

CATATGGGCC ATCATCATCA TCATCACGGA TCCAAACCAC CGAGCGGTTC GCCTGAAACG 60 

GGCGCCGGCG CCGGTACTGT CGCGACTACC CCCGCGTCGT CGCCGGTGAC GTTGGCGGAG 120 

ACCGGTAGCA CGCTGCTCTA CCCGCTGTTC AACCTGTGGG GTCCGGCCTT TCACGAGAGG 180 

TATCCGAACG TCACGATCAC CGCTCAGGGC ACCGGTTCTG GTGCCGGGAT CGCGCAGGCC 240 
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GCCGCCGGGA CGGTCAACAT TGGGGCCTCC GACGCCTATC TGTCGGAAGG TGATATGGCC 300 

GCGCACAAGG GGCTGATGAA CATCGCGCTA GCCATCTCCG CTCAGCAGGT CAACTACAAC 360 

CTGCCCGGAG TGAGCGAGCA CCTCAAGCTG AACGGAAAAG TCCTGGCGGC CATGTACCAG 420 

GGCACCATCA AAACCTGGGA CGACCCGCAG ATCGCTGCGC TCAACCCCGG CGTGAACCTG 480 

CCCGGCACCG CGGTAGTTCC GCTGCACCGC TCCGACGGGT CCGGTGACAC CTTCTTGTTC 540 

ACCCAGTACC TGTCCAAGCA AGATCCCGAG GGCTGGGGCA AGTCGCCGGG CTTCGGCACC 600 

ACCGTCGACT TCCCGGCGGT GCCGGGTGCG CTGGGTGAGA ACGGCAACGG CGGCATGGTG 660 

ACCGGTTGCG CCGAGACACC GGGCTGCGTG GCCTATATCG GCATCAGCTT CCTCGACCAG 720 

GCCAGTCAAC GGGGACTCGG CGAGGCCCAA C TAGGCAATA GCTCTGGCAA TTTCTTGTTG 780 

CCCGACGCGC AAAGCATTCA GGCCGCGGCG GCTGGCTTCG CATCGAAAAC CCCGGCGAAC 840 

CAGGCGATTT CGATGATCGA CGGGCCCGCC CCGGACGGCT ACCCGATCAT CAACTACGAG 900 

TACGCCATCG TCAACAACCG GCAAAAGGAC GCCGCCACCG CGCAGAC CTT GCAGGCATTT 960 

CTGCACTGGG CGATCACCGA CGGCAACAAG GCCTCGTTCC TCGACCAGGT TCATTTCCAG 1020 

CCGCTGCCGC CCGCGGTGGT GAAGTTGTCT GACGCGTTGA TCGCGACGAT TTCCTCCGGA 1080 

GGTGGCAGTG GGGGAGGCTC AGGTGGAGGT TCTGGCGGGA GCGTGCCCAC AACGGCCGCC 114 0 

TCGCCGCCGT CGACCGCTGC AGCGCCACCC GCAC CGGCGA CACCTGTTGC CCCCCCACCA 12 00 

CCGGCCGCCG CCAACACGCC GAATGCCCAG CCGGGCGATC CCAACGCAGC ACCTCCGCCG 1260 

GCCGACCCGA ACGCACCGCC GCCACCTGTC ATTGCCCCAA ACGCACCCCA ACCTGTCCGG 132 0 

ATCGACAACC CGGTTGGAGG ATTCAGCTTC GCGCTGCCTG CTGGCTGGGT GGAGTCTGAC 138 0 

GCCGCCCACT TCGACTACGG TTCAGCACTC CTCAGCAAAA CCACCGGGGA CCCGCCATTT 1440 

CCCGGACAGC CGCCGCCGGT GGCCAATGAC ACCCGTATCG TGCTCGGCCG GCTAGACCAA 15 00 

AAGCTTTACG CCAGCGCCGA AGCCACCGAC TCCAAGGCCG CGGCCCGGTT GGGCTCGGAC 1560 

ATGGGTGAGT TCTATATGCC CTACCCGGGC ACCCGGATCA ACCAGGAAAC CGTCTCGCTC 1620 

GACGCCAACG GGGTGTCTGG AAGCGCGTCG TATTACGAAG TCAAGTTCAG CGATC CGAGT 16 80 

AAGCCGAACG GCCAGATCTG GACGGGCGTA ATCGGCTCGC CCGCGGCGAA CGCACCGGAC 1740 

GCCGGGCCCC CTCAGCGCTG GTTTGTGGTA TGGCTCGGGA CCGCCAACAA CCCGGTGGAC 18 00 

AAGGGCGCGG CCAAGGCGCT GGCCGAATCG ATCCGGCCTT TGGTCGCCCC GCCGCCGGCG 1860 

CCGGCACCGG CTCCTGCAGA GCCCGCTCCG GCGCCGGCGC CGGCCGGGGA AGTCGCTCCT 1920 

ACCCCGACGA CACCGACACC GCAGCGGACC TTACCGGCCT GA 1962 

(2) INFORMATION FOR SEQ ID NO: 3 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 652 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 

Met Gly His His His His His His Gly Ser Lys Pro Pro Ser Gly Ser 

15 10 15 

Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro Ala Ser 

20 25 30 

Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr Pro Leu 

35 40 45 

Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn Val Thr 

50 55 60 

He Thr Ala Gin Gly Thr Gly Ser Gly Ala Gly He Ala Gin Ala Ala 

65 70 75 80 

Ala Gly Thr Val Asn He Gly Ala Ser Asp Ala Tyr Leu Ser Glu Gly 

85 90 95 

Asp Met Ala Ala His Lys Gly Leu Met Asn He Ala Leu Ala He Ser 



275 



100 105 110 



Ala Gin 


Gin Val 


Asn Tyr Asn Leu 


Pro Gly Val Ser Glu His 


Leu Lys 




115 




12 0 


125 




Leu Asn Gly Lys 


Val 


Leu Ala Ala 


Met Tyr Gin Gly Thr He 


Lys Thr 


130 






135 


140 




Trp Asp 


Asp Pro 


Gin 


He Ala Ala 


Leu Asn Pro Gly Val Asn 


Leu Pro 


145 






150 


155 


160 


Gly Thr Ala Val 


Val 


Pro Leu His 


Arg Ser Asp Gly Ser Gly 


Asp Thr 






165 




170 


175 


Phe Leu 


Phe Thr 


Gin 


Tyr Leu Ser 


Lys Gin Asp Pro Glu Gly 


Trp Gly 




180 






185 190 




Lys Ser 


Pro Gly Phe Gly Thr Thr 


Val Asp Phe Pro Ala Val 


Pro Gly 




195 




200 


205 




Ala Leu Gly Glu Asn Gly Asn Gly 


Gly Met Val Thr Gly Cys 


Ala Glu 


210 






215 


220 




Thr Pro Gly Cys 


Val 


Ala Tyr He 


Gly He Ser Phe Leu Asp 


Gin Ala 


225 






230 


235 


240 


Ser Gin 


Arg Gly 


Leu 


Gly Glu Ala 


Gin Leu Gly Asn Ser Ser 


Gly Asn 






245 




250 


255 


Phe Leu 


Leu Pro 


Asp Ala Gin Ser 


He Gin Ala Ala Ala Ala 


Gly Phe 




260 






265 270 




Ala Ser 


Lys Thr 


Pro 


Ala Asn Gin 


Ala He Ser Met He Asp 


Gly Pro 
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280 


285 




Ala Pro 


Asp Gly 


Tyr 


Pro He He 


Asn Tyr Glu Tyr Ala He 


Val Asn 


290 






295 


300 




Asn Arg 


Gin Lys 


Asp 


Ala Ala Thr 


Ala Gin Thr Leu Gin Ala 


Phe Leu 


305 






310 


315 


320 


His Trp 


Ala He 


Thr 


Asp Gly Asn 


Lys Ala Ser Phe Leu Asp 


Gin Val 






325 




330 


335 


His Phe 


Gin Pro 


Leu 


Pro Pro Ala 


Val Val Lys Leu Ser Asp 


Ala Leu 




340 






345 350 




He Ala 


Thr He 


Ser 


Ser Gly Gly 


Gly Ser Gly Gly Gly Ser 


Gly Gly 




355 




360 


365 




Gly Ser 


Gly Gly 


Ser 


Val Pro Thr 


Thr Ala Ala Ser Pro Pro 


Ser Thr 


370 






375 


380 




Ala Ala 


Ala Pro 


Pro 


Ala Pro Ala 


Thr Pro Val Ala Pro Pro 


Pro Pro 


385 






390 


395 


400 


Ala Ala 


Ala Asn 


Thr 


Pro Asn Ala 


Gin Pro Gly Asp Pro Asn 


Ala Ala 






405 




410 


415 


Pro Pro 


Pro Ala 


Asp 


Pro Asn Ala 


Pro Pro Pro Pro Val He 


Ala Pro 




420 






425 430 




Asn Ala 


Pro Gin 


Pro 


Val Arg He 


Asp Asn Pro Val Gly Gly 


Phe Ser 




435 




440 


445 




Phe Ala 


Leu Pro Ala Gly Trp Val 


Glu Ser Asp Ala Ala His 


Phe Asp 


450 






455 


460 




Tyr Gly Ser Ala 


Leu 


Leu Ser Lys 


Thr Thr Gly Asp Pro Pro 


Phe Pro 


465 






470 


475 


480 


Gly Gin Pro Pro 


Pro 


Val Ala Asn 


Asp Thr Arg He Val Leu 


Gly Arg 






485 




490 


495 


Leu Asp Gin Lys 


Leu 


Tyr Ala Ser 


Ala Glu Ala Thr Asp Ser 


Lys Ala 




500 






505 510 




Ala Ala Arg Leu Gly Ser Asp Met 


Gly Glu Phe Tyr Met Pro 


Tyr Pro 




515 




520 


525 




Gly Thr Arg He 


Asn 


Gin Glu Thr 


Val Ser Leu Asp Ala Asn 


Gly Val 



530 535 540 
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Ser Gly Ser Ala Ser Tyr Tyr Glu 
545 550 
Pro Asn Gly Gin He Trp Thr Gly 
565 

Ala Pro Asp Ala Gly Pro Pro Gin 
580 

Thr Ala Asn Asn Pro Val Asp Lys 
595 600 
Ser lie Arg Pro Leu Val Ala Pro 

610 615 
Ala Glu Pro Ala Pro Ala Pro Ala 
625 630 
Pro Thr Thr Pro Thr Pro Gin Arg 
645 



Val Lys Phe Ser Asp Pro Ser Lys 
555 " 560 

Val He Gly Ser Pro Ala Ala Asn 

570 575 
Arg Trp Phe Val Val Trp Leu Gly 
585 590 
Gly Ala Ala Lys Ala Leu Ala Glu 
605 

Pro Pro Ala Pro Ala Pro Ala Pro 
620 

Pro Ala Gly Glu Val Ala Pro Thr 
635 640 
Thr Leu Pro Ala 
650 



